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Features 


Hi 


igh Performance 


50 MHz single-phase clock 

50 MIPS peak execution rate 

Separate CPU data and instruction buses 
Full 32-bit internal and external architecture 
3-stage integer execution pipeline and IEEE 
floating-point execution unit with overlapped 
instruction fetch and decode operations 
On-chip IEEE floating-point execution unit 
Upward-compatible with CLIPPER C100 


Streamlined Instruction Set 


9 addressing modes 

Most frequently used instructions execute in one 
clock cycle 

Macro instructions for operating system support and 
optimal use of bus bandwidth 

Multiple programmable register sets for efficiency 
— 16 32-bit user registers 

— 16 32-bit supervisor registers 

— 8 64-bit floating-point registers 


8 K-Byte Total Instruction and Data Caches 


4 K-byte instruction cache 

4 K-byte data cache 

256 line two-way set-associative, 16-byte line size 
cache organization 

User-enabled instruction prefetch for maximum hit 
rate and performance of the pipeline 

Bus Watch for system data integrity 
Write-through, copy-back, and noncacheable 
caching policies on a per-page basis 


Memory Management 


Demand paged virtual memory 

4 G-byte virtual address space per process 

4 G-byte real memory address space 

Separate user and supervisor modes 

128 line two-way set-associative Translation 
Lookaside Buffer each for data and instructions 
Memory read, write, and execute access protection 
on a per-page basis 

Dynamic Translation Unit and page table update 


High-Speed and Flexible Bus 


High-bandwidth synchronous bus 


Byte, halfword, word, and quadword transfers 
Support for both big endian and little endian byte 
orderings 


InterrupvVException Processing 


Macro instructions for exception processing 
256 vectored interrupts with 16 priority levels 
Separate interrupt bus for high-speed interrupt 
processing 

20 predefined traps 

128 system calls 


Multiprocessing Support 


Bus Watch checks all caches in system for cached 
data 

Global writes access all caches and TLBs in system 
Global system clock synchronization 

Test-and-set operation 

Multiple caching strategies 


The CLIPPER C300 Compute Engine is an architectural- 
ly advanced, very high-performance CMOS 32-bit 
microprocessor module consisting of a CPU, two 
Cache/MMU chips, and clock control circuitry. The CPU 
includes an IEEE standard Floating-Point Execution Unit. 


The CLIPPER C300 is a Single Instruction/Single Data 
architecture with instruction prefetch overlapped on mul- 
tiple execution units. The basic instruction set is stream- 
lined and hardwired for maximum performance. 

Because the control section of the CPU is a hardwired 
logic state machine, rather than a microcoded engine, in- 
structions execute at a maximum rate of one per clock 
cycle. The CPU contains two 32-bit buses: one for data 
and one for instructions. Multistage pipelined instruction 
processing, combined with a dual cache/MMU design, 
permit concurrency at all stages of program execution. 

In addition, the integrated Floating-Point Unit executes 
instructions concurrently with the integer execution unit. 
A high-bandwidth synchronous bus architecture easily in- 
terfaces to high-speed peripherals, 1/O, and memory 
subsystems, and supports both big endian and little en- 
dian byte orderings. 


1. 


Introduction 


The CLIPPER C300 32-bit Microprocessor Compute En- 
gine (see Figure 1) consists of three CMOS VLSI chips 
together with a Clock Control chip. The VLSI chips are: 
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Figure 1 CLIPPER C300 Module Block Diagram 
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1) a high-performance, dual bus CPU/FPU, 2) an in- 
struction cache/MMU chip (I-CAMMU), and 3) a data 
cache/MMU chip (D-CAMMU). The CLIPPER Module in- 
terface is a 96-pin connector which is buffered through 
a set of user-supplied drivers. 


The CLIPPER Module interface signals comprise the 
CLIPPER Bus and consist of a 32-bit, multiplexed 
data/address bus, bus arbitration control, bus control, 
clock control, interrupt control, error signalling, diagnos- 
tics, and reset. 


1.1. CPU 

The CLIPPER CPU is a high-performance, full 32-bit in- 
ternal and external (via separate 32-bit instruction and 
data buses) processor with a load/store architecture. 
The CPU is highly pipelined for maximum instruction ex- 
ecution and contains a 32 x 32-bit general register file, 
two ALUs (one for integer execution and one for floating- 
point execution), a streamlined instruction set, a Macro 
Instruction Unit (for exception processing instructions, in- 
terrupt handling instructions, and macrocoded instruc- 
tions), and a complete Floating-Point Unit. Figures 2 


Figure 2 Simplified CPU Block Diagram 
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and 3 show simplified and detailed block diagrams of 
the CPU. 


1.1.1. Pipelining and Concurrency 

The CPU utilizes a fetch, decode, and execution 
pipeline as shown in Figure 4. The CPU two-stage in- 
struction control unit (see Figure 2) supports two 
instruction execution units that operate in parallel, per- 
mitting up to four instructions (three integer and one 
floating-point) to be in the execution stage concurrently. 
Instruction control (the upper pipeline) includes both 
fetch and decode; decode includes both resource 
management and issue. The parallel execution units 
(lower pipeline) execute integer and floating-point opera- 
tions concurrently. Program counter values accompany 
instructions through the upper pipeline for exception 
processing and branch control by the CPU. 


There are two stages of instruction fetch, namely, from 
memory to the instruction cache (ahead of actual CPU 
demand) and from the cache to the CPU's Instruction 
Buffer. The Instruction Buffer can hold up to four instruc- 
tions. Immediate values are sent from the Instruction 
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Buffer via the J register to an L register in the ALU 
pipeline, for use as address offsets or data values. The 
J register and L register stages are shown in Figure 3. 


Instruction decode and resource management are per- 
formed in the B stage. The B stage obtains instruction 
parcels from either the Instruction Buffer or the Macro 
Instruction Unit. Resource management is accomplished 
by comparing an instruction request for a resource 
against a table of resources busy. 


In the final stage of the upper pipeline (decoded and as- 
sembled instruction is in the C stage), instructions are is- 
sued for execution to the integer execution unit or the 

floating-point execution unit if no resource conflict exists. 


The lower pipeline consists of two parallel execution 
units, an integer execution unit and a floating-point ex- 
ecution unit. The integer execution pipeline has three 
stages. In the first stage, operands are read from the 
general register file. The general register file has three 
ports that operate concurrently in a single clock period; 
two ports are for reading and the third is for writing. 
Thus, two reads and a write may be performed in a 
single clock. In the second stage, the ALU output is writ- 
ten to the A register or directly to the D-CAMMIU inter- 


Figure 3 Detalied CPU/FPU Block Diagram 
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face. In the third stage, the contents of the A register 
are output to the FPU, the bypass mux (to the ALU), to 
the general register file or to the D-CAMMU interface. 


1.1.2. Integer Execution Unit 

The Integer Execution Unit executes all instructions ex- 
cept those handled by the FPU. It contains a register 
file (supervisor and user sets), a serial double-bit shifter, 
and a 32-bit Arithmetic Logic Unit (ALU). 


The ALU is used for address computation as well as 
data manipulation. Nine addressing modes are sup- 
ported, each requiring only one pass through the ALU. 


When the result output by the ALU is needed by the in- 
struction immediately following it, a feedback mecha- 
nism allows the result from the current ALU operation to 
be fed back into the ALU for the next operation. Ad- 
dresses and data for integer load and store operations 
are passed directly to the D-CAMMU interface, bypass- 
ing the A register. 


1.1.3. Floating-Point Execution Unit (FPU) 

An integrated Floating-Point Unit performs single- and 
double-precision floating-point operations concurrently 
with the integer execution unit, using its own ALU and 


INTEGER EXECUTION UNIT 
we eee ee ee ee ee ee ee ee ee ee 4 






Mca aa tae aa Je tee ey Se 


DATA BUS INTERFACE 










GOKAHEAD 


Os 


CLIPPER CPU/FPU 
BLOCK DIAGRAM 


Lee ee ee ee ee ee eee ee ee eee ee ee He eee 


4 


2 
AGSS-1-A 


This Material Copyrighted By Its Respective Manufacturer 


CLIPPER® C300 
32-Bit Compute Engine 


Advance Information 





set of eight 64-bit registers. The FPU controller is tightly 
coupled to CPU issue logic to reduce overhead. 
Because the Floating-Point Unit is on the CPU chip, 
CLIPPER Bus accesses are usually not required. This 
reduces bus traffic and improves performance. 


All CLIPPER floating-point arithmetic operations support 
the IEEE 754 Standard. For more information, refer to 
Section 4, Floating-Point Unit. 


1.1.4. Macro Instruction Unit 

The Macro instruction (MI ROM) Unit stores instruction 
sequences of the basic hardwired instruction set. When 
a macro instruction is encountered in the instruction 
stream, an instruction sequence is read from the Mi 
ROM and inserted into the B stage of the upper 
pipeline. The width of the ROM word is such that the in- 
struction pipeline can be maintained at the maximum of 
one parcel (one halfword) every clock. When the MI se- 


Figure 4 CLIPPER Pipeline 
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quence ends, the instruction stream is switched back to 
the Instruction Buffer as the source. 


The MI Unit provides three types of instruction 
sequences: 


— Those that provide direct support for the operating 
system: for example, context switching and 
trap/interrupt entry and return; 


— Those that perform certain floating-point operations 
not directly implemented in the Floating-Point Unit: 
for example, single to double and double to 
single-precision floating-point conversions; 


— Commonly used complex instructions which are 
typically found in so-called complex instruction set 
computers: for example, character string 
manipulations. 
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Instructions from the M! ROM are provided with addition- 
al Mi register files, thus avoiding resource conflicts with 
the floating-point and general-purpose registers. 


1.2. CAMMU 

In addition to the CPU, the C300 Compute Engine 
Module includes two Cache/Memory Management Unit 
(CAMMU) chips, an Instruction Cache/MMU, and a 
Data Cache/MMU. The CAMMUs interface to the CPU 
via a high-speed, 32-bit internal module bus and inter- 
face to main memory and 1/O devices via the CLIPPER 
Bus. 


1.2.1. Instruction and Data Caches 

Two separate, 4 K-byte cache memories, one for data 
and one for instructions, act as transparent high-speed 
buffers between the CPU and main memory. Each 
cache is two-way set-associative, containing 256 quad- 
word lines of frequently used instructions or data. For 
fast CPU access, each cache also contains a virtual ad- 
dress cache consisting of a 16-byte buffer containing 
the quadword that was most recently accessed from the 
cache, and a register containing the virtual address of 
the quadword. 


Because most CPU fetches are satisfied directly from 
the cache, the access time of real memory has far less 
effect on total system performance. Minimizing fetches 
from main memory also reduces bus traffic and allows 
greater bandwidth for other bus masters or I/O 
processors. 


Bus Watch is the monitoring of the CLIPPER Bus trans- 
actions by the CAMMUs. It is used to ensure data con- 
sistency between the cache and main memory, and to 
ensure that the latest data is always transferred to an 
V/O device reading main memory. Bus Watch is 
transparent to software. 


A demand fetch algorithm is implemented in both the 
l-CAMMU and D-CAMMU. A prefetch algorithm is also 
implemented in the I-CAMMU; it can be enabled or dis- 
abled under program control. 


1.2.2. Memory Management Unit (MMU) 

The Memory Management Unit translates CPU virtual 
addresses to real addresses in one of three separate 
real spaces (I/O, Boot, or Main Memory) using transia- 


tion tables located in main memory. In order to minimize 
the time required to obtain these translations, an addi- 
tional two-way set-associative Translation Lookaside 
Buffer (TLB) in each CAMMU holds 128 of the most fre- 
quently used values from the translation tables for both 
instructions and data. 


When the TLB does not contain the required translation 
entry, the MMU fetches the required value from main 
memory and updates the TLB. 


The MMU also supports main memory access protec- 
tion (read, write, and execute). 


1.3. Clock Control Unit 

The CLIPPER C300 clock chip provides both internal 
clocks for use on the Module and external clocks which 
can be used for system clocking. The frequency of 
MCLK, one of the internal clocks not available to the 
user, determines the rate of operation of the CPU and 
CAMMUs. The two external clocks generated by the 
clock chip are BCLK and 2BCLK. BCLK is the CLIPPER 
Module bus clock. The BCLK frequency is the rate of 
operation of the CLIPPER bus. 2BCLK is twice the fre- 
quency of BCLK. With an externally supplied 100 MHz 
oscillator, MCLK is 50 MHz, and BCLK/2BCLK is either 
25/50 MHz or 12.5/25 MHz, depending on the state of 
the RATE control pin on the CLIPPER Bus. See Section 
9, CLIPPER Bus, for details. 


2. Memory Organization 

The real memory of a CLIPPER system is organized as 
a sequence of 32-bit words, each word consisting of 
four 8-bit bytes. Each byte is assigned a unique ad- 
dress ranging from 0 to 4,294,967,295 (4 G-bytes). 


By using virtual memory techniques, a CLIPPER system 
can appear to have a full 4 G-bytes of physical memory 
available to each user program. See Section 9, 
CLIPPER Bus, for details. 


There are three real address spaces defined in the 
CLIPPER architecture: 


-—— Main memory space 
— WO space 
— Boot space 
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Main memory, I/O space, and Boot space are acces- 
sible in both user and supervisor modes. The memory 
space accessed by a given address is determined by 
the System Tag associated with the page. 


The Hardwired Translation Lookaside Buffer (HTLB) is a 
feature of the CAMMU which guarantees TLB hits of 
special memory pages by the supervisor. The first four 
pages of real main memory space have HTLB entries in 
the CAMMUs, as do the first two pages of both /O 
space and Boot space. The HTLB is used in supervisor 
mode only. The HTLB is described in detail in Section 
7.2.2, Fixed Address Translation. CLIPPER's three 


Figure 5 Real Address Spaces—HTLB Mapping 
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memory spaces and the mapping of the HTLB are 
shown in Figure 5. 


2.1. Data Types 

The CLIPPER C300 architecture supports the primitive 
data types shown in Figure 6. There are signed and un- 
signed bytes, halfwords (16 bits), words (32 bits), and 
longwords (64 bits), as well as single-precision (32-bit) 
and double-precision (64-bit) IEEE Standard fioating- 
point numbers. 


Bit 0 is the least-significant bit of all data types, and bit 
numbers within a byte increase from right to left. 
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Figure 6 C300 Primitive Data Types Data should be aligned in memory, that is, data types 
should begin at an address that is a multiple of their 
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2.1.1. Representation of Data in Memory 

There are two ways in which multiple-byte data may be 
VALUE addressed in CLIPPER memory: in big endian order or 
in little endian order. With big endian ordering, the ad- 
dress of a multiple-byte data type corresponds to the ad- 
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3. Programming Model 
The basic programming model for the CLIPPER C300 is 
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3.1. Register Sets visor modes. The SSW is writable only in supervisor 
The CPU contains three sets of registers: 16 user mode; it should not be read in user mode to ensure 
registers (r0-r15), 16 supervisor registers (r0-r15) and 8 compatibility with future Intergraph CLIPPER products. 


floating-point registers (f0- f7). 
The Program Counter (PC) contains the address of the 


The user and floating-point registers are accessible in current instruction. This is interpreted as a virtual ad- 
both user and supervisor modes; the supervisor dress if CLIPPER is operating in mapped mode, and as 
registers are accessible only in supervisor mode. a physical address if CLIPPER is operating in un- 

mapped mode (see Section 3.3, Mapped and Un- 
There are two status and control registers: the Program mapped Addresses). The PC is accessible by both the 
Status Word (PSW) and the System Status Word user and the supervisor. 


(SSW). The PSW is accessible in both user and super- 


Figure 9 Program Status Word 
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3.1.1. User and Supervisor Registers 

The user and supervisor registers, r0-r15, are general- 
purpose, 32-bit registers. They are used for word (32- 
bit) and longword (64-bit) integer operations. 


Bytes and halfwords used in load and store instructions 
are sign- or zero-extended to 32 bits before they are put 
in registers. Longword values are stored in register 
pairs, with the least significant word in the even-num- 
bered register. When double-precision (64-bit) floating- 
point data types are moved to an integer register pair, 
the least-significant fraction occupies the even num- 
bered register. 


3.1.2. Floating-Point Registers 

The floating-point registers, f0-f7, are used by the FPU 
for floating-point instructions, which are executed concur- 
rently with instructions in the ALU. These 64-bit 

registers are used for floating-point operands in both 
single- and double-precision IEEE format. Single- 
precision operands stored in floating-point registers 

have zeros in the 29 least significant fraction bits and in 
the three most significant exponent bits. 


The integer multiply, divide, and mod instructions are ex- 
ecuted by the FPU, but use registers r0-r15 (user or su- 
pervisor). 


3.1.3. Program Status Word (PSW) 

The PSW, shown in Figure 9, contains flags which iden- 
tify and, together with the SSW, control a program's 
response to various exceptions resulting from integer 
and floating-point operations (see Section 6, Excep- 
tions). This register also indicates byte ordering, as 
described in Section 2.1, Data Types. 


On hardware reset, the trace trap (T) flag is cleared, the 
big endian (BIG) flag is set to reflect the state of the 
BIG control pin, and the remaining PSW bits are un- 
defined. 


C,V,Z,N: Condition Codes 

The condition codes are modified only by the register-to- 
register integer instructions, string instructions, floating 
comparison, and by directly writing the PSW. They are 
tested by the branch on condition instruction. 
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FX, FU, FD, FV, Fl: Floating-Point Exception Flags 
The floating-point exception flags are set by hardware 
on exceptions arising from floating-point operations, in 
accordance with the IEEE 754 Floating-Point Standard. 
Once set, they are cleared only by user software or, for 
those conditions for which the corresponding trap is 
enabled (j.e., when both EFT and the individual enable 
flag are set) by the trap handler. They are tested by the 
branch on floating exception instruction (see Section 
6.2.2, Floating-Point Arithmetic Trap Group, for more 
details). 


EFX, EFU, EFD, EFV, EFI: Enable Floating Flags 

The IEEE floating-point trap enable flags are set by 
software to request the result that would be given to a 
trap handler on an exception, rather than the IEEE 
default (no-trap) result. If the EFT bit is set, enabled ex- 
ceptions also cause traps. See Section 6.2.2, Floating- 
Point Arithmetic Trap Group, for a description of the use 
of this field by trap handler routines. 


EFT: Enable Floating Trap 

When set, the enable floating trap flag enables traps to 
occur whenever an exception is signalled by the FPU 
and that exception's trap enable flag is also set. When 
this bit is clear, floating-point traps are disabled and 
program execution continues regardless of the values of 
the floating trap enable flags. 


FR: Floating Rounding Mode 

The floating-point rounding mode field is set by software 
to select the IEEE rounding mode for floating-point 
operations. 


The default is round to nearest, in which the rounded 
result is the closest representable number to the exact 
result, with ties decided in favor of the representable 
number with zero as its least-significant fraction field bit. 


When rounding toward +0, the result is the format's 
value (possibly +0) closest to and no less than the in- 
finitely precise result. When rounding toward -, the 
result is the format's value (possibly -2) closest to and 
no greater than the infinitely precise result. When round- 
ing toward 0, the result is the format's value closest to 
and no greater in magnitude than the infinitely precise 
result. 
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BIG: Big Endian 

The big endian flag indicates the byte ordering of CLIP- 
PER memory, as described in Section 2.1.1, Represen- 
tation of Data in Memory. 


When CLIPPER is reset, this flag is set by hardware to 
the state of the BIG control pin. if BIG is tied to GND, 
BIG = 1; if BIG is tied to VCC, BIG = 0. This flag is read 
only; writes to this flag are ignored. 


T: Trace Trap Enable 

The trace trap enable flag is set by the user or super- 
visor to request a trace trap following execution of the 
next instruction. It is cleared by the user or supervisor 
to disable the trace. 


CTS: GPU Trap Status 

The CPU trap status field is set by the hardware to indi- 
cate the cause of a CPU-related trap (see Section 6.2, 
Traps). 


MTS: Memory Trap Status 

The memory trap status field is set by the hardware to 
indicate the cause of a memory-related trap (see Sec- 
tion 6.2, Traps). 


3.1.4. System Status Word (SSW) 

The SSW controls the CLIPPER C300's mode of opera- 
tion (user or supervisor) and provides status and control 
for program protection and the response to interrupts 
(see Figure 10). It may be written in supervisor mode 


Figure 10 System Status Word 


31 30 29 28 27 26 25 24 23 22 21 «+17 16 


only. Reset clears the following SSW flags: El, TP, M, 
U, K, KU, UU and P. The remaining flags are undefined 
except for 1D, which is hardwired. This represents un- 
mapped supervisor mode with all maskable interrupts 
disabled. 


The SSW is written using the movwp (move word to 
processor register) instruction. When used with the 
SSW, this instruction can take as its second operand 
either processor register 1 (ssw) or processor register 3 
(sswf). movwp using processor register 1 behaves like 
a branch instruction, causing the upper pipeline to be 
flushed. movwp with processor register 3 does not 
cause the pipeline to be flushed, is thus faster, but must 
only be used in cases where the modification of the 
SSW will not compromise the memory mapping of the 
subsequent code in the upper pipeline. That is, because 
the K, U, KU, and UU protection bits are compared with 
the PL field of the TLB or HTLB entry for memory ac- 
cess protection, a memory reference that would have 
failed may succeed or vice versa, or it may fail different- 
ly, or it may succeed for the wrong reasons. Therefore, 
processor register 3 may only be used when modifying 
the IN, IL, El, FRO, TP, ECM, KU and UU fiags; 
modifications of the M, K, U and P flags must use the 
movwp instruction with processor register 1. 


Descriptions of IN, IL, El, TP, and ECM are given below 
and in Section 6, Exceptions. M, KU, UU, K, U, and P 
are described below and in Section 7.2.1, Translation 
Lookaside Buffer (see Protection Level field description). 


9 8 7 


FIELD MEANING 


IN Interrupt number 
IL Interrupt level 

El Enable interrupts 
ID CPU ID 


FRD ‘Floating registers dirty 

TP Trace trap pending 

ECM _ Enable corrected memory 
error 
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FIELD MEANING 


M Mapped mode 
KU User protect key 
UU User data mode 


K Protect key 
U User mode 
P Previous mode 
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IN: Interrupt Number 

The interrupt number field is set by hardware (INTRAP 
and retl) and by software (movwp) to indicate the 
system's current interrupt number. If an interrupt of 
equal or higher priority occurs during the service of an 
interrupt, this value (along with the interrupt level) will 
be pushed on the stack, and this field will be updated 
with a new interrupt number value. Interrupt numbers 
are not prioritized. 


IL: Interrupt Level 

The interrupt level field is set by hardware (INTRAP and 
reti) and by software (movwp) to establish the system's 
current interrupt priority level. Only interrupts of equal or 
higher priority (equal or lesser value) than this value will 
be recognized. If an interrupt of equal or higher priority 
occurs during an interrupt service routine, this value will 
be pushed on the stack, and this field will be updated 
with the new interrupt level. 


El: Enable Interrupts 
The enable interrupt flag is set by software to enable in- 
terrupts. It is cleared by software to disable interrupts. 


ID: CPU ID 

This 8-bit field is hardwired to encode the version num- 
ber of the CLIPPER CPU/FPU. This field is read only; 
writes to this field are ignored. 


FRD: Floating Registers Dirty 

The floating register dirty flag is set by hardware when- 
ever a floating-point register is written. This flag may be 
cleared by software. Its purpose is to permit operating 
systems to reduce context switching overhead. 


TP: Trace Trap Pending 

The trace trap pending flag is automatically set by 
hardware whenever a trap or interrupt occurs during an 
instruction and the T flag is set. This ensures that the 
trace trap is taken immediately after the trap or interrupt 
handler has finished executing, and that a single instruc- 
tion is traced exactly once. 


On data page faults, the supervisor must clear TP 
before restarting the faulting instruction in order to en- 
sure that the instruction is traced exactly once. 
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ECM: Enable Corrected Memory Error Trap 

The enable corrected memory flag is set by software to 
request a trap whenever a corrected memory error oc- 
curs. When this flag is set, a logic low on the 
MSBE/RETRY signal line (indicating a single-bit 
memory error) causes a trap. 


M: Mapped Mode 

The mapped mode fiag is set by software to cause all 
address references to be mapped through the page 
tables. When set, virtual address (VA) to real address 
(RA) translation by the CAMMUs is enabled (mapped 
mode). When cleared, VA to RA translation by the 
CAMMUs is disabled (unmapped mode). The only ex- 
ceptions are the first eight pages when in supervisor 
mode. These pages are always mapped via the HTLB, 
regardiess of the state of this flag. 


U:User Mode 

The user mode flag is set by the supervisor to indicate 
user mode of operation and cleared to indicate super- 
visor mode of operation. 


K: Protect Key 

The protect key flag is set and cleared by the super- 
visor to select one of two sets of memory access protec- 
tion codes for memory access validation and protection 
during program execution. This flag is used for the ac- 
cess protection code selection in user mode, and in su- 
pervisor mode when the UU flag is clear (see Table 10 
in Section 7.2, Memory Management Unit). 


KU: User Protect Key 

The user protect key flag is set and cleared by the su- 
pervisor program to select one of two sets of memory 
access protection codes for memory access validation 
and protection during program execution. This flag is 
used for the access protection code selection only 
during supervisor program execution when the UU flag 
is set (see Table 10 in Section 7.2, Memory Manage- 
ment Uni. 


UU: User Data Mode 

The user data mode flag is set and cleared by the su- 
pervisor to select either supervisor or user data address 
space access when memory data is referenced in super- 
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visor mode, and to select either the K or KU key flags 
for selection of the access protection codes used during 
supervisor memory references. When the UU flag is set, 
supervisor data references access user data space, and 
the KU flag is used for access protection code selec- 
tion. When the UU flag is clear, supervisor data referen- 
ces access supervisor data space, and the K flag is 
used for access protection code selection. This flag is 
significant only in supervisor mode (see Table 10 in Sec- 
tion 7.2, Memory Management Unit ). 


P: Previous Mode 
The previous mode flag is copied from the U flag when- 
ever the INTRAP sequence is executed. 


3.2. Supervisor and User Modes of Operation 

The C300 Module has two modes of operation, user 
and supervisor, as selected by the SSW's U flag. User 
and supervisor modes are distinguished by the set of in- 
structions which they are permitted to execute, and by 
the registers and logical address space they are per- 
mitted to access. 


All instructions can be executed in supervisor mode. In- 
structions which can be executed only in supervisor 
mode are called privileged instructions. When a 

program in user mode attempts to execute these instruc- 
tions, a privileged instruction trap occurs. 


Figure 11 Address Mapping—Mapped/Unmapped Modes 










SUPERVISOR 
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Programs executing in user mode have access only to 
the user registers (r0-r15), floating-point registers (f0-f7), 
the PSW, and the PC. Supervisor mode programs have 
access to all programmer-accessible registers. 

Memory address spaces are distinct for user and super- 
visor modes. Different translation tables may be used 
for translating user and supervisor mode addresses, 
and the access protection provided by the memory 
management mechanism allows access by supervisor 
programs to memory locations inaccessible to user 
mode programs. 


Supervisor mode is entered only via the INTRAP se- 
quence, or when the system is reset. 


3.3. Mapped and Unmapped Addresses 

CLIPPER can operate in two modes: mapped and un- 
mapped. In mapped mode, the CAMMU translates user 
and supervisor virtual addresses to real addresses 
using the TLB or the HTLB (for supervisor virtual 
addresses 0 - 777F Hex); in unmapped mode, only the 
HTLB is used for translation. The mode is selected by 
the M (mapped mode) flag in the SSW. When this flag 
is set, CLIPPER operates in mapped mode; when this 
flag is clear, CLIPPER operates in unmapped mode. 
The two modes are shown in Figure 11. Virtual to real 
address translation is discussed in Section 7.2, Memory 
Management Unit.. 


USER 
eo 
UNMAPPED MAPPED 
MODE MODE 
UNMAPPED VA MAPPED 
VA=RA TO RA 
3 
Manufacturer 


4. Floating-Point Unit 

The C300 Floating-Point Unit (FPU) executes addition, 
subtraction, multiplication, and division operations con- 
forming to the IEEE Standard for Binary Floating-Point 
Arithmetic (ANSIAEEE Std 754-1985) in the single- and 
double-precision formats. In addition, the floating-point 
comparison operations are provided for both 
precisions. The floating-point execution unit also per- 
forms integer multiplication, division, and mod opera- 
tions. 


Comparisons of floating-point numbers can result in the 
familiar trichotomy of b < a, b = a, b <a, as well as the 
condition b .unordered. a, which arises when either b or 
a is a non-numeric value (NaN). Results of the com- 
parison are indicated in the PSW condition codes at the 
conclusion of a floating-point comparison. Conditional 
branch instructions allow these condition codes to be 
used. 


The floating-point execution unit performs one operation 
at a time, reusing internal resources over a number of 
CPU clocks in order to complete the operation, includ- 
ing the handling of special case operands and results 
mentioned below. 


4.1. Floating-Point Register Usage 

All of the floating-point arithmetic instructions are 
register-to-register operations, using the eight floating- 
point registers available to software. These registers are 
capable of holding either single or double format 
operands interchangeably. The floating-point registers 
may be directly loaded from memory or may be loaded 
by transfer from the integer register file. Storing of 
operands may be direct to memory or by transfer to the 
integer register file. Additional "scratch pad” registers, 
transparent to the user, are available to the Macro In- 
struction Unit. 


Integer multiplication, division, and mod operations are 
also register-to-register, but in this case the registers 
used are in the integer register file; no floating-point 
registers are involved. 


Because separate instructions are provided for single- 
and double-precision operations, a rounding precision 
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mode field is not required because the precision is 
defined by the instruction field. All four rounding modes 
called for in the Standard are provided by the FR field 
in the PSW. 


4.2. Floating-Point Exceptions and Traps 
Exceptional operands and operation results are handled 
in conformity with the requirements of the IEEE Stan- 
dard. The special operands include signalling and quiet 
Not-a-Number (NaN), signed infinities, signed zeros, 
and denormal numbers, as well as the wealth of ordi- 
nary normalized numbers. 


If the corresponding trap enable flag in the PSW is set, 
and the PSW’s floating-point trap group enable flag is 
also set, then a floating-point trap occurs. The CPU 
then invokes a program called a trap handler, which 
may be user-specified. When a trap handler is entered, 
the load floating status (loadfs) instruction can be ex- 
ecuted to acquire useful information about the instruc- 
tion causing the exception. Floating-point exceptions are 
discussed in greater detail in Section 6, Exceptions. 


4.3. FPU Software Initialization 
The IEEE Standard requires the following initial condi- 
tions: 


— The rounding mode must be round nearest. 
— The floating-point exception flags must all be cleared. 


— All floating-point traps must be disabled, and default 
results for all exceptions must be enabled. 


This initialized state is accomplished by clearing all FPU- 
related bits in the PSW. 


The contents of f0-f7 should be set to a known value. 
Some programming languages require that these values 
be initialized to zero. The [EEE Standard, on the other 
hand, provides for special reserved values and calls 
these NaN, or Not-A-Number. Whichever of these is 
chosen, this value should be created and loaded into 
each of the floating-point registers. 
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An example FPU initialization is as follows: 
loadq $0, r0 # Create zero 
movwp 10, psw # Load PSW with rounding 
# mode 00 (nearest) and 
# clear all exception 
# flags and trap enable bits 
loadi $0x7ffbad75,r1. # Load high half of hex 
# NaN 1.bad75a 
loadi = $0x00000000, rO # Load low half of NaN 


movid 0, f0 # Store in floating register 0 
movid ro, f1 # Store in floating register 1 
movid 10, f2 # Store in floating register 2 
movid 10, f3 # Store in floating register 3 
movid 10, f4 # Store in floating register 4 
movid 10, f5 # Store in floating register 5 
movid 0, f6 # Store in floating register 6 
movid 0, £7 # Store in floating register 7 


The NaN used in the initialization above is a quiet NaN. 
A quiet NaN propagates through arithmetic operations 
unchanged, except for the sign bit, which is undefined 
for NaNs. Thus, any user who operates on a register 
not yet defined will receive this NaN as a result. 


5. Instruction Set 

The CLIPPER C300 instruction set of 101 basic and 67 
macro instructions is streamlined for speed and the 
most effective use of the system's resources and 
register sets. This smaller, faster instruction set is espe- 
cially useful to high-level language compilers that op- 
timize register usage, branch timing for maximum 
speed, and pipeline sequencing. 


Memory access is by load/store instructions to minimize 
memory-dependent execution delays. All data opera- 
tions are performed on registers by hardwired instruc- 
tions. 


There are two units in the C300 CPU that execute in- 
structions: the Integer Execution Unit (IEU) and the 
Floating-Point Execution Unit (FPU). The integer instruc- 
tions (with the exception of integer multiplies and 
divides) are executed by the IEU. Floating-point instruc- 
tions (and the integer multiplies and divides) are ex- 
ecuted by the FPU. 


Most instructions are fetched from main memory. Each 
instruction is fetched (through the instruction cache), 


decoded, then executed, either by the IEU or by the 
FPU. The only exceptions are the macro instructions. 


A macro instruction opcode selects a sequence of in- 
structions in the macro instruction ROM (MI ROM). 
When a macro instruction is decoded, execution control 
is switched to the MI ROM, and the sequences of the 
macro instruction are executed. 


The instruction set is listed in Table 7. 


5.1. Instruction Formats 

The information encoded in each instruction specifies 
the operation to be performed, the type of operands to 
use (if any), and the location of the operands. The 
mnemonic and operands of the assembly language 
source statement determine the instruction format used. 


Most instructions require one or more operands in the 
source statement. These operands can be located in a 
register or in memory. For example, the loadb instruc- 
tion contains operands that reference memory and a 
register. If an operand is located in memory, the instruc- 
tion must calculate the address of the operand accord- 
ing to the address mode specified in the instruction 
format. 


An operand can also be encoded within the instruction. 
The immediate and quick instructions use this type of 
format for efficient operation. 


All instructions are constructed in multiples of haifwords 
called parcels (see the general instruction format 
below). Depending on the instruction format used, the 
size of an instruction varies from one to four parcels. 


usps tse 
63 43. 47 32_31 16 (15 687 90 


CE, 


FOURTH THIRD SECOND ®RST 
PARCEL PARCEL PARCEL PARCEL 





Figure 12 shows CLIPPER C300 instruction formats. 
Notice that the formats are divided into two main 
categories, non-memory referencing instructions (NO 
ADDRESS) and memory referencing instructions (WITH 
ADDRESS). 
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INSTRUCTION FORMATS - NO ADDRESS 
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Quick 
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Table 1 Instruction Set, by Function 


LOAD/STORE INSTRUCTIONS 





Load Byte Unsigned 
Load Double Floating 
Load Floating Status 
Load Halfword 

Load Halfword Unsigned 
Load Immediate 

Load Quick 

Load Single Floating 
Load Word 


Store Byte 

Store Double Floating 
Store Halfword 

Store Single Floating 
Store Word 


DATA MOVEMENT INSTRUCTIONS 





Move Double Floating 

Move Double to Longword 

Move Longword to Double 

Move Processor Register to Word 
Move Single Floating 

Move Supervisor to User (privileged) 
Move Single to Word ‘ 

Move User to Supervisor (privileged) 
Move Word 

Move Word to Processor Register 
Move Word to Single Floating 





ARITHMETIC INSTRUCTIONS 





Add Double Floating 
Add Immediate 

Add Quick 

Add Single Floating 
Add Word 

Add Word with Carry 


Subtract Double Floating 
Subtract Immediate 
Subtract Single Floating 
Subtract Word 

Subtract Word with Carry 


Multiply Double Floating 
Multiply Single Floating 
Multiply Word 

Multiply Word Unsigned 
Multiply Word Extended 


Divide Double Floating 
Divide Single Floating 

Divide Word 

Divide Word Unsigned 


Negate Double Floating 
Negate Single Floating 
Negate Word 


Modulus Word 
Modulus Word Unsigned 


Scale by, Double Floating 
Scale by, Single Floating 
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Table 1 Instruction Set, by Function (cont.) 
LOGICAL INSTRUCTIONS 





Exclusive-OR Immediate 
Exclusive-OR Word 


Not Quick 
Not Word 





CHARACTER STRING INSTRUCTIONS 





Compare Characters 
Initialize Characters 


Move Characters 





CONVERSION INSTRUCTIONS 


Convert Double to Single 

Convert Double to Word 

Convert Rounding Double to Word 
Convert Rounding Single to Word 
Convert Single to Double 

Convert Truncating Double to Word 
Convert Truncating Single to Word 
Convert Word to Double 

Convert Word to Single 





COMPARE AND TEST INSTRUCTIONS 





Compare Double Floating 
Compare Immediate 
Compare Quick 
Compare Single Floating 
Compare Word 


Test and Set 


SHIFT/ROTATE INSTRUCTION 





Shift Arithmetic Immediate 

Shift Arithmetic Longword 

Shift Arithmetic Longword Immediate 
Shift Arithmetic Word 

Shift Logical Immediate 

Shift Logical Longword 

Shift Logical Longword Immediate 
Shift Logical Word 


Rotate Immediate 

Rotate Longword 

Rotate Longword Immediate 
Rotate Word 


STACK MANIPULATION INSTRUCTIONS 


Pop Word 
Push Word 





Restore Registers fn-f7 
Restore User Registers (privileged) 
Restore Register m-r14 


Save Registers fn-f7 
Save User Registers (privileged) 
Save Registers m-r14 


CONTROL INSTRUCTIONS 


Branch on Condition 
Branch on Floating Exception 





Call Subroutine 
Call Supervisor 


No Operation 

Return From Subroutine 

Return From Interrupt (privileged) 
Trap on Floating Unordered 


Wait for Interrupt (privileged) 
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5.1.1. Instruction Formats—No Address 

Register 

The Register format is used for most instructions that 
take just one or two register arguments. 


EXAMPLE INSTRUCTION INSTRUCTION FORMAT 


movw rs, 6 


= 


The opcode specifies the interpretation of the R1 and 
R2 fields. Usually the R1 field contains the source 
operand register number, and R2 contains the destina- 
tion operand register number. For example, in the 
movew instruction, the R1 field contains the number of 
the single-precision floating-point register containing the 
source operand, and the R2 field contains the number 
of the general register in which to store the result. 


Quick 

The Quick format encodes constant, 4-bit unsigned 
source operands directly in the instruction. The quick 
value is always zero-filled at the left before use. 


EXAMPLE INSTRUCTION INSTRUCTION FORMAT 


loadqg $15 , r10 





ee 


Aga 


16-bit Immediate 

The 16-bit Immediate format encodes a 16-bit source 
operand constant directly in the instruction. The im- 
mediate value is always sign-extended before use. 


EXAMPLE INSTRUCTION INSTRUCTION FORMAT 


addi 


$17 , 4 








OPCODE 


IMMEDIATE 












A04s: 
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32-bit Immediate 
The 32-bit Immediate format encodes a constant, 32-bit 
source operand directly in the instruction. 


EXAMPLE INSTRUCTION INSTRUCTION FORMAT 


andi $Oxtifff , 6 







IMMEDIATE LOW 





IMMEDIATE HIGH 


Aga 


Control 

The Control format encodes up to 8 bits of a constant 
value that is used by several special instructions. For ex- 
ample, the byte operand specifies the system call num- 
ber in the calls instruction. 


EXAMPLE INSTRUCTION INSTRUCTION FORMAT 


calls $17 





Ags? 


Macro 

The Macro format is used by those instructions that are 
implemented as macros rather than directly in the 
hardware. The P bit in the opcode, bit 9 of the first in- 
struction parcel, selects a privileged macro. 


EXAMPLE INSTRUCTION INSTRUCTION FORMAT 


t3 7 


cnvsw 







‘ 
OPCODE P 0/00 










oooo00co00 A 





This Material Copyrighted By Its Respective Manufacturer 


Ane 


CLIPPER® C300 
32-Bit Compute Engine 


Advance Information 





5.1.2. Instruction Formats—With Address 

The remaining instruction formats specify an address 
operand and a register operand. Several address for- 
mats, or modes, are provided to support typical high- 
level language operations. The address mode is 
selected first by the opcode (bit 8 of the first instruction 
parcel), and if necessary, by the AM field (bits <7:4> of 


Table 2 Memory Addressing Modes 


Memory Addressing Mode 


Relative 

Relative plus 12-bit displacement 
Relative plus 32-bit displacement 
16-bit Absolute 

32-bit Absolute 

PC Relative plus 16-bit displacement 
PC Relative plus 32-bit displacement 
Relative Indexed 

PC Indexed 


Notes: 
All displacements are signed. 
PC addresses the first parcel of the current instruction. 


the first instruction parcel). Displacements and absolute 
addresses are always sign-extended. 


The address modes used in the memory referencing in- 
structions are summarized in Table 2 and explained in 
the following pages. 


Address Formation 


Address — (R1) 

Address <- (R1) + 12-bit displacement 
Address < (R1) + 32-bit displacement 
Address < 16-bit displacement 
Address <- 32-bit displacement 
Address <~ (PC) + 16-bit displacement 
Address — (PC) + 32-bit displacement 
Address <— (R1) + (RX) 

Address <— (PC) + (RX) 


RX is any general register containing the index modifying the effect of the source register. 


21 


This Material Copyrighted By Its Respective Manufacturer 


CLIPPER® C300 
32-Bit Compute Engine 


Advance Information 


Relative 
The Relative format uses the address in a register (R1) 


to compute an address. 


_ADDRESS FORMATION __ 


EXAMPLE INSTRUCTION INSTRUCTION FORMAT 


loadw (r2) , 12 


A049 





Relative Plus 12-bit Displacement 

The Relative Plus 12-bit Displacement format uses the 
address in a register (R1), plus a signed 12-bit displace- 
ment, to compute an address. The displacement is sign- 
extended to 32 bits before the address calculation. 


EXAMPLE INSTRUCTION INSTRUCTION FORMAT ADDRESS FORMATION 








storw we , 4 (ep) 





3t 0 


ADORESS FROM REGISTER 


31 12 11 10 U) 


—— EXTEND SIGN s | DISPLACEMENT 


31 0 


(eh 
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Relative Plus 32-bit Displacement 

The Relative Pius 32-bit Displacement format uses the 
address in a register (R1), plus a signed 32-bit displace- 
ment, to compute an address. 


EXAMPLE INSTRUCTION INSTRUCTION FORMAT ADORESS FORMATION 











ee ee ee 





c 
DISPLACEMENT LOW a ° 
pL eeucmereon oe 
xu i] 


16-bit Absolute 

The 16-bit Absolute format uses the signed 16-bit ad- 
dress, which is sign-extended to 32 bits before use. Be- 
cause the address field is signed, the range of 
addresses that can be accessed with this format is -2"5 
< address < 2'5 -1, 


EXAMPLE INSTRUCTION INSTRUCTION FORMAT ADDRESS FORMATION 


tests jock , m1 






16.15 14 


31 (] 
fp ereeeer |e] oon 


3 0 


(se 
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The 32-bit Absolute format uses the 32-bit displacement 
portion of the instruction as an address. 


EXAMPLE INSTRUCTION INSTRUCTION FORMAT ADDRESS FORMATION 


31 o 


a 





PC Relative Pius 16-bit Displacement 

The PC Relative Plus 16-bit Displacement format adds 
a signed 16-bit displacement to the contents of the 
Program Counter (PC) to compute an address. 


EXAMPLE INSTRUCTION INSTRUCTION FORMAT ADDRESS FORMATION 








31 o 


ADDRESS FROM PROGRAM COUNTER 


16 15 14 
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PC Relative Plus 32-bit Displacement 

The PC Relative Plus. 32-bit Displacement format adds 
a signed 32-bit displacement to the contents of the 
Program Counter (PC) to compute the address. 


EXAMPLE INSTRUCTION INSTRUCTION FORMAT ADDRESS FORMATION 


call 










31 9 
ADDRESS FROM PROGRAM COUNTER 


| DISPLACEMENT HIGH 
DISPLACEMENT LOW 












3 ) 
SIGNED DISPLACEMENT 





31 o 
ADDRESS 3 


Relative Indexed 

The Relative Indexed format uses the address in a 
register (R1), plus the contents of an index register 
(RX), to compute an address. 


EXAMPLE INSTRUCTION INSTRUCTION FORMAT ADDRESS FORMATION 








toadbu = [r3]_— (fp) 








31 0 
OPCODE 1 fuaa0 | om | ADDRESS FROM REGISTER 
peeeeeee[ = [= | i 






ADDRESS FROM REGISTER 


a 
= 
° 


DDRESS 


25 


This Material Copyrighted By Its Respective Manufacturer 


PC indexed 

The PC Indexed format adds the contents of an index 
register (RX) to the contents of the PC to compute an 
address. 


EXAMPLE INSTRUCTION INSTRUCTION PORMAT. 












[ao 


5.2. Instruction Set Summary 

Table 5 is a summary of the instruction set. Each instruc- 
tion is described by several columns in the table. The 
columns are as follows: 


Instruction Name 
The full name of the instruction. 


Syntax 

Assembler instruction name and operand formats. The 
left letter of the operand code specifies the operand’s 
type and size. The right letter of the operand code 
specifies the operand’s field within the instruction and 
its location in the machine (immediate value, register, 
memory, etc.). 


Operand Type 














Operand Field 


b byte 8 single floating 1 Ri a address 
h halfword d double floating {2 R2 _ b byte 
w word p processor register}q quick 

| longword immediate 





For example, the operand code w1 indicates a word 
operand in the general register whose number is en- 
coded in the Rit field of the instruction. The code sa indi- 
cates a single floating operand in the memory location 
whose address is given by one of the addressing 

modes in Section §.1.2, Instruction Formats —With Ad- 
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ADDRESS FORMATION 





dress. Quick and immediate operand types are always 
w because these directly encoded values are always 
zero or sign extended to a word before use. 


Opcode 

Hexadecimal opcode. Address format instructions use 
two opcodes; the first one listed is for relative mode, 
and the second is for all other addressing modes. 
Macro format instructions show the entire first parcel. 


Format 
Instruction format. See Section 5.1, Instruction Formats. 


Operation 

Basic operation of the instruction. The operations of 
complex instructions like move are simplified or ab- 
breviated. Fixed registers are given by name, e.g., r0, f1. 


CVZN 
Effect of the instruction on condition codes in the PSW. 
O = always set to 0. 
1 always set to 1. 
unaffected. 


* set according to the operation. 


Fi, FV, FD, FU, FX 
Effect of the instruction on the floating-point exception 
flags in the PSW. Same key as CVZN. 
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Traps 

Traps that can be caused by the instruction. 
Corrected Memory Error 
Uncorrectable Memory Error 
Data Alignment fault 

Page Fault 

Read Protect Fault 

Write Protect Fault 

Divide by Zero 

Illegal Operation 

Supervisor Only (privileged)Instruction 


o-OUsDU>rCcO 
gh un eu tou 


All instructions can cause traps from the Instruction 
Memory Trap group in the !-CAMMU (for example, an 


Table 3 Assembler Operand Syntax 


General registers. The even general 
registers address long operands. sp, fp, 
and ap are synonyms for r15, ri4, and 
r13. Not to be confused with R1 or R2, 
which are register fields within an 
instruction. 


fO.. f7 Floating registers. Each register may 
contain a single or double floating value. 

psw, ssw, | Processor registers 0, 1,and 3. 

sswf 

$n Quick, byte or immediate value. 

n Absolute address. 

n(rm) Relative or relative with displacoment 
address. n may be 0 or absent. 

[rx](rn) Relative indexed address. 

n(pe) PC relative address. 

or tn 

[rx}(pe) PC indexed address. 

label Absolute or PC relative address depend- 


ing on the circumstances. 
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execute protect fault), so these are not shown. Possible 
floating-point traps are indicated by an asterisk ( * ) in 
the FI..FX columns. 


The instruction operand codes described above also 
describe the syntax of each instruction operand. As- 
sembler operand syntax is given in Table 3 below. As- 
sembler instruction operands are generally given in 
source, destination order independent of their positions 
in the machine representation. Table 4 lists the 
operators used in the operation field. 


Table 4 Operators 


Notation Meaning 
Rotate operator 

sha Shift arithmetic operator 

shl Shift logical operator 
+ Add operator 
- Subtract operator 
x Multiply operator 
+ Divide operator 

mod Modulus operator 
~ Logical complement operator 
= Equal operator 

- Assignment operator 
& AND logical operator 
| OR logical operator 

@ Exelusive-OR logical operator 
() Contents of operand within 
{] Separators used to indicate value inside as 

a unit 

<> Bit field of previous value 
rs Indicates a range of values 
tT Exponentiation 
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Tabie 5 Instruction Operations 


FFFFF 
instruction Name Syntax Opcode Format Operation IVDUX CVZN Traps 
eee a OU 
Add Double Floating addd d1,d2 22 Register d2<— (d2) + (d1) Pe. 
Add immediate addi wiw2 83 immediate w2 — (w2) + wi ae | 
Add Quick addq waw2 82 Quick w2 < (w2) + wq tn ee 
Add Single Floating adds 81,82 20 Register s2<— (82) + (s1) eee ee Lee 
Add Word addw so w1,w2 80 Register w2 <— (w2) + (w1) CC idl 
Add Word with Carry addwoc wiw2 90 Register w2 < (w2) + (w1)+C ce ae 
And Immediate andi wiw2 8b Immediate w2 <- (w2) & wi se eee O08 | 
And Word andw wi,w2 68 Register w2 < (w2) & (w1) seeee OOS 
Branch on Conditional be ha 48,49 Address IF cond, PC < ha sees ee es Al 
Branch on Fioating Exception bf« ha 4c,4d Address iF cond, PC — ha eee ee ee AY 
Call Routine call w2,ha 44,45 Address w2 « (w2)— 4, (W2) <— (PC), ---- - -- A,P,W 
PC — ha 
Call Supervisor calls bb 12 Control trap 400 + 8 x bb<7:0> ten ee 
Compare Characters empe b4 Of Macro while [(10)#0] & (((r2))=((r1)), = te * CULPLR 
rO=length, ri=string!, r=string2 10 e (10) — 1, 71 <— (r1) + 1, 
2 — (12 +1 
Compare Double Floating cmpd d1,d2 27 Register (d2) — (d1) sees OOF 
Compare Immediate cmpi wiw2 a7 immediate (w2) — wi tee ee eee | 
Compare Quick cmpq  wa,w2 aé Quick (w2) - wq te ee 
Compare Single Floating cmps = 81,82 25 Register (s2) — (81) seeee OOFF 
Compare Word cmpw sow1,w2 a4 Register (w2) — (w1) ce 
Convert Double Floating to Single cnvds  d1,s2 b4 39 Macro 82 < (d1)} ee. ee eee 
Convert Double Floating to Word cnvdwsd1,w2 b4 34 Macro w2 — (d1) | le 
Convert Rounding Double to Word = cnvrdw ss d1,w2 b4 35 Macro w2 <— (d1) eon et 
Convert Rounding Single to Word = cnvrsw ss 81,w2 b4 31 Macro w2 <— (s1) eee 
Convert Single Floating to Double = cnvsd = 81,02 b4 38 Macro d2 — (81) *. . 
Convert Single Floating %2 Word cnvsw = 8 1,w2 b4 90 Macro w2 — (s1) eet 
Convert Truncating Double to Word cnvidw  d1.w2 b4 36 Macro w2 <— (d1) *- * 
Convert Truncating Single to Word cnvtsw 81,w2 b4 32 Macro w2 <— (s1) weet 
Convert Word to Double Floating  cnvwd = wi,d2 b4 37 Macro d2 — (wt) sree 
Convert Word to Single Floating cnvws w1,s2 b4 33 Macro 82 — (w1) ee 
Divide Double Floating divd d1,d2 2b Register d2 < (d2) + (d1) eetee 
Divide Single Floating divs 31,82 29 Register s2 <— (s2) + (s1) eeeee ee 
Divide Word diw = wiw2-s 9c Register w2 < (w2) + (wt) -+++. 0#£00D 
Divide Word Unsigned diwu wiw2 = 9e Register w2 <— (w2) + (w1) -eee- 0000 D0 
Initialize Characters inite b4 0e Macro while (r1)#0, see ee ee BW 
rO=length, risdest, r2=pattern (1) — (12<7:0>), 
0 <— (0) -— 1, 71 — (Ft) +1, 
2 — (r2) rot-8 
Load Address foada =—s ba, w2 62,63 Address w2 «ba oe | 
Load Byte loadb ba,w2 68,69 Address w2 «- (ba) cee + ee» CUA,P,R,I 
Load Byte Unsigned foadbu = ba,w2 6a,6b Address w2 « (ba) seek 2 ee CULAP,RI 
Load Double Floating loadd = da,d2. 66,67 Address d2 <— (da) wees fees CUAP,RI 
Load Floating Status loadfs wi,d2 b4 3f Macro wi — (FP PC), cee ee 
d2 <— (FP dest) 
Load Halfword joadh = ha.w2 6c,6d Address 9 w2 < (ha) tees ne ee CUAPRE 
Load Halfword Unsigned loadhu = ha,w2 6e,6f Address w2 < (ha) cee ne ee CUAPLRI 
Load Immediate loadi wi,w2 87 immediate w2 <— wi cee ee OOF8F 
Load Quick loadq 4 owaw2 86 Quick w2 — wq see ee 0040 
Load Single Floating loads = $a, s2 64,65 Address 82 «- (sa) wee ee es CUAP,RI 
Load Word loadw waw2 60,61 Address w2 < (wa) sees 4 es CULA,P,R,I 
28 
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Table 5 Instruction Operations (cont.) 





FFFFF 
Instruction Name Syntax Opcode Format Operation 1 VDUX CVZN Traps 
Modulus Word modw ss wi,w2 9d Register w2 — (w2) mod (w1) se eee 0000 
Modulus Word Unsigned modwu w1,w2 of Register w2 <— (w2) mod (w1) seees 090000 
Move Characters move b4 Od Macro while (r0) = 0, (r2) — ((r1)), «= +: + + + + C,U,P,.R.W 
rO=length, r1=source, r2=dest 0 — (10) — 1, ¢1 — (r1) +1, 
2 — (r2) +1 
Move Double Floating movd di,d2 26 Register d2 < (d1) toe nee 
Move Double Floating to Longword movd! 41,!2 2e Register l2 <— (d1) a 
Move Longword to Double Floating movid = 11,d2 2t Register d2 < (I1) see ee 
Move Processor Register to Word movpw = p1,w2 an Register w2 < (p1) 
Move Single Floating movs $1,82 24 Register s2 < (81) Fee ee 
Move Supervisor to User (privileged) movsu)  w1,w2 D6 01 Macro w2 — (w1) crane OOH4S 
Move Single Floating to Word movsw  81,w2 2c Register w2 < (s1) te ee 
Move User to Supervisor (privileged) movus wi,w2 b600 Macro w2 <— (w1) -- see 0044S 
Move Word movww wiw2 84 Register w2 < (w1) sees OO88 
Move Word to Processor Register movwp w2,pi 10 Register p1 <— (w2) oe oo 
Move Word to Single Floating movws) wi,s2 2d Register s2 — (w1) tte ee 
Multiply Double Floating muld d1,d2 2a Register d2 < (d2) x (d1) esas 
Multiply Single Floating muls $1,82 28 Register s2 < (s2) x (st) eh. we tne 
Multiply Word mulw = wiw2 98 Register w2 <— (w2) x (w1) seers 0400 
Multiply Word Unsigned mulwu wiw2 9a Register w2 <— (w2) x (w1) sense 0400 
Multiply Word Unsigned Extended = mutwux w1,l2 9b Register 12 — (w2) x (w1) seers 0400 
Multiply Word Extended mulwx wi,l2 99 Register I2 — (w2) x (w1) seers 0400 
Negate Double Floating negd d1,d2 b4 3b Macro d2 — — (di) a 
Negate Single Fioating negs 81,82 b4 3a Macro s2—-(81) «reese see 
Negate Word negw wi,w2 93 Register w2 < — (w1) ce ee 
No Operation noop bb 00 Control none te ne 
Not Quick notq wqw2 ae Quick we <- ~wq seeee OOO! 
Not Word notw wiw2 ac Register w2 <— ~(w1) sees 0088 
Or immediate ori wiw2 &f Immediate w2 — (w2) | wi seeee OCOeFT 
Or Word orw wi,w2 8c Register w2 — (w2) | (w1) sense 0088 
Pop Word popw = w1,w2 16 Register wi <— (wt) +4, sees 4 ees CUAP,R 
w2 <— ((w1) - 4) 
Push Word pushw = w2,w1 14 Register wi < (w1)-4, sree 2 te + APW 
(w1) <— (w2) 
Restore Registers fn .. f7 restdn b4 28 Macro fr... f7—((r98)).. ee - +++ C,UA,P,R 
Osns7 is ((r15) + 8 x [7-n)), 
b4 2F r15 < (r15) + 8 x [8—n] 
Restore User Registers (privileged) restur wi b6 03 Macro r0 .. r15 & ((w1)) .. see 4 eee CUAP,R,S 
((w1) + 60) 
Restore Registers rn .. r14 restwn b4 10 Macro mm... 114 & ((r5)) .. see 4 ees CULA,P,R 
Osnsi12 = ((r15) + 4 = [14—n), 
b41C r15 © (r15) + 4 x ([15—n] 
Retum From Routine ret w2 13 Register PC <— ((w2)) tonne oe ee + CULA,P,R 
w2 — (w2) +4 
Return From Interrupt (privileged) reti wi b6 04 Macro Restore SSW,PSWandPC.---- *«*#+#*# §& 
Rotate Immediate roti wiw2 3c Immediate w2 <— (w2) rot wi cee ee OOH] 
. Rotate Longword roti wil2 35 Register 12 < (I2) rot (w1) sre ee OOF* 
Rotate Longword immediate roti wi,l2 3d Immediate 12 — (12) rot wi tenes OOF] 
Rotate Word row wi1,w2 34 Register w2 <— (w2) rot (w1) seers OO8® 
Save Registers fn .. {7 savedn b4 20 Macro (r15) -8 x (8-n] .. soe ee oe ee A,PLW 
O<sns7 # (ri5) — 8 — (fn) .. (17), 
b4 27 r15 < (r15) — 8 x [8 — n} 
29 
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Table 5 Instruction Operations (cont.) 


FFFFF 
Instruction Name Syntax Opcode Format Operation IVDUX CVZN Traps 
Save User Registers (privileged) saveur wt b6 02 Macro (wi) — 4... (W1) — 64 & sees oe ees APW,S 
(r15) .. (r0) 
Save Registers rn .. 14 savewn b4 00 Macro (715) — 4x [15 —n] .. cee oe ees AYP 
Osnsi12 a (r15) — 4 € (rn) .. (714), 
b4 0C 115 — (r15) — 4.x [15 - n] 
Scale by, Double Floating scalbd wi,d2 643d Macro d2< (d2)x 20) ee OR ee 
Scale by, Single Floating scalbs wis2 b43c Macro s2« (s2)x 2” ee 
Shift Arithmetic immediate shai wiw2 38 Immediate w2 <— (w2) sha wi vee Oeve | 
Shift Arithmetic Longword shal wil2 31 Register [2 <— (12) sha (w1) cee Qe ee 
Shift Arithmetic Longword Immediate shali wi,i2 39 Immediate 12 — (12) sha wi teens Qeee | 
Shift Arithmetic Word shaw wiw2 30 Register w2 < (w2) sha (w1) canes Qe ee 
Shift Logical Immediate shli wiw2 8a Immediate w2 <— (w2) shi wi reece OO8F] 
Shift Logical Longword shi wi l2 33 Register 12 <— (12) shi (w1) sees OF 4F 
Shift Logical Longword Immediate _shili wi!2 3b Immediate '2 <— (12) shi wi nee ee OOF] 
Shift Logical Word shiw wiw2 32 Register w2 <— (w2) shi (w1) seers OOF F 
Store Byte storb w2,ba 78,79 Address ba <— (w2) see ee + ee > APWII 
Store Double Floating stord d2,da 76,77 Address da < (d2) sees ee es APWL 
Store Halfword storh w2,ha 7c,7d Address ha < (w2) wee te ee APWY 
Store Single Floating stors s2,sa 74,75 Address sa <- (82) secre oe eee API 
Store Word stow w2,wa 70,71 Address wa< (w2) eee ee ee APWI 
Subtract Doubie Floating subd d1,d2 23 Register d2 «- (d2) — (d1) 2 
Subtract Immediate subi wiw2 a3 immediate w2 <- (w2) — wi a i | 
Subtract Quick subq waq,w2 a2 Quick w2 < (w2) — wq ne ow dd 
Subtract Single Floating subs 81,82 21 Register s2 <— (s2) — (81) 2 
Subtract Word subw wi,w2 ad Register w2 <— (w2) — (w1) te ee 
Subtract Word with Cary subwo = w1,w2 91 Register w2 <- (w2) —(w1)—C cn ee 
Test and Set tsts wa,w2 72,73 Address w2<(wa),wae1  <--°° oe e+ CUA,P, 
RW 
Trap on Floating Unordered trapfn b4 3e Macro IF PSW<ZN> indicates I 
unordered, illegal instruction 
trap 
Wait for Interrupt (privileged) wait b6 05 Macro Wait for interrupt cee ee ee S 
Exclusive-OR Immediate xori wi,w2 ab immediate w2 <— (w2) ® wi sees OO84] 
Exclusive-OR Word xorw wi we a8 Register w2 <— (w2) @ (wt) cee ee OOFF 
Legend: 
PSW Flags Field Traps Field 
- = Flag not affected by instruction D = Divide-by-zero 
+ = Flag set according to operation | = legal instruction 
0 = Flag set to0 M = Memory fauit 
1 = Flag set to 1 P = Page fault 


R = Read protect fault 
S = Supervisor only (priviledged) instruction 
W = Write protect fault 
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Table 6A _ Integer Branch Conditions 






PSW Flags 
Condition 
Branch always 


ame | Compare R1:R2 


cle Less or Equal brge Greater or Equal 


A 


brit Less Than 






Result R2:0 


Greater Than 


Less or Equal 





Not Equal 


Greater Than Unsigned 
brgeu Greater or Equal Unsigned 
Less Than Unsigned 


bege 
jeene [rerewt 
| beltu | Less Than Unsigned 


Less or Equal Unsigned 
9 i+ | x{x[X| begtu Greater Than Unsigned 


Less or Equal Unsigned 


The R2 field of the branch on condition instruction 
selects the condition on which to branch. When a 
Condition choice of mnemonics is shown, use the ones begin- 
ning with be if the condition codes to be tested were 
set by a compare instruction. Use the mnemonics 
beginning with br if they were set by move or logical in- 
Overflow structions (those instructions that set only N or 2). 


Not Overflow 
Negative 
Not Negative 


Table 6B Floating Branch Conditions 











Condition 


Floating ANY exception 
Floating BAD result 
Reserved 


Floating Unordered 
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6. Exceptions 

The CLIPPER C300 architecture supports 404 excep- 
tion conditions: 20 hardware traps, 128 programmable 
supervisor call traps, and 256 vectored interrupts. 


Traps are exceptions recognized by the CPU during ex- 
ecution of single instructions (e.g., divide by zero, page 
fault). A trap causes all instructions in both the upper 
and lower pipelines to either be backed out or com- 
pleted in a manner consistent with program restart. 


Interrupts are events signalled by devices external to 
the C300 Module and input to the module via the inter- 
rupt pins. Interrupts are taken when the following condi- 
tions are met: 


— Interrupts are enabled. 


— The Interrupt Level (IVEC<7:4>) is less than or equal 
to the IL field in the SSW. 


— All instructions in the lower pipeline have finished 
executing. String instructions have either completed 
execution or have detected the interrupt and saved 
sufficient state information for continuation. 


— No traps are pending. 


A flow chart showing the necessary conditions for inter- 
rupts is shown in Figure 14. 


The address of the service routine for each trap, super- 
visor call, and interrupt is stored in an Exception Vector 
Table (see Table 7), located in the first real page of 
main memory. The Exception Vector Table (EVT) con- 
tains a two-word entry for each exception, consisting of 
the starting address of the exception’s service routine 
and an SSW value associated with the routine. Unas- 
signed EVT addresses are reserved for future use by In- 
tergraph and must be initialized to point to a valid 
handler routine. 


The priority of exceptions is the order shown in the EVT 
(in the order from highest to lowest priority), except that 
the trace trap has the lowest priority. The CLIPPER 
C300 Module’s internal priority logic ensures that excep- 
tion service is always granted to the highest priority 
event. 
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Table 7 Exception Vector Table 






Real Address 
(Hex) 
Data Memory Trap Group 


Description 






























108 Corrected Memory Error 
110 Uncorrectable Memory Error 
120 Alignment Fault 

128 Page Fault 

130 Read Protect Fault 

138 Write Protect Fault 


Floating-Point Arithmetic Trap Group 


180 Floating Inexact 

188 Floating Underflow 

190 Floating Divide by Zero 
1A0 Floating Overflow 

100 Floating Invalid Operation 


Integer Arithmetic Trap Group 
208 Integer Divided by Zero 
Instruction Memory Trap Group 


288 Corrected Memory Error 
290 Uncorrectable Memory Error 
2A0 Alignment Fault 

2A8 Page Fault 

2B0 Execute Protect Fault 


llega! Operation Trap Group 
300 INegal Operation 
308 Privileged Instruction 
Diagnostic Trap Group 








380 Trace Trap 
Supervisor Calis 

400 Supervisor Call 0 

408 Supervisor Call 1 

7F8 Supervisor Call 127 
Prioritized Interrupts: 

800 Non-Maskable Interrupt 

808 Interrupt Level 0 Number 1 


Interrupt Level O Number 2 










810 





878 
880 
888 


Interrupt Level O Number 15 
Interrupt Level 1 Number 0 
Interrupt Level 1 Number 1 


Interrupt Level 15 Number 15 


Manufacturer 
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6.1. INTRAP and reti Sequences 

Two macro instruction sequences, INTRAP and reti, 
manage the entry to and exit from both traps and inter- 
rupts. The INTRAP sequence performs a non-interrup- 
table context switch to supervisor mode, and then 
transfers control to the trap or interrupt handler. The reti 
sequence is an interrupt/trap returm, also non-interrupt- 
ible, which restores the system to the correct user or su- 
pervisor environment. 


During the INTRAP and reti sequences, all interrupts 
are disabled; traps are not disabled, but only serious 
system faults can occur, as explained below. 


The INTRAP sequence begins by saving the PC, SSW, 
and PSW on the supervisor stack as shown in Figure 
13. The saved PSW will have MTS or CTS set to indi- 
cate the cause of the trap. INTRAP then copies the 
SSW's user mode flag (U) into the previous mode flag 
(P). In order to access the Vector Table, INTRAP sets 
the user mode flag to supervisor mode and clears the 
protect key (K), user data mode (UU), and user protect 
key (KU). The PSW is cleared. 


The address of the required Exception Vector Table 
entry, V, is then obtained in one of three ways: 1) For 
traps and the non-maskable interrupt, the address is 
generated from internal trap logic. 2) For supervisor 
calls, the address is generated from the lower 7 bits of 
the instruction. This value is multiplied by 8 and 400H is 
added to it. 3) For priority interrupts, a number is read 
from the Interrupt Bus lines, IVEC<7:0>. This value is in- 
verted, multiplied by 8, and 800H is added to it. 
INTRAP uses V to obtain the new PC value and V + 4 
to obtain the associated SSW value. The new SSW 


Figure 13 Supervisor Stack After INTRAP 
HIGHER ADDRESSES 


L 


115 


LOWER ADDRESSES 
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value is transferred to the SSW, overwriting the pre- 
vious contents of SSW except for the previous mode 
flag (P), which is retained in order to indicate the mode 
of the interrupted program. INTRAP then exits, and con- 
trol is passed to the trap or interrupt service routine. 


After completing its service, the trap or interrupt handler 
executes the ret] sequence, which restores the PSW, 
SSW and PC to their contents prior to INTRAP. 


6.1.1. Faults During INTRAP and reti 

The occurrence of a trap during INTRAP or reti results 
in an Unrecoverable Fault (URF). The C300 halts in a 
controlled suspended state, drives the URF signal low 
as an alarm, and waits until restarted by the RESET sig- 
nal. (In the URF state, all inputs other than RESET are 
ignored.) 


To avoid the occurrence of a page fault during INTRAP 
or retl (and the resulting URF condition), the supervisor 
stack must always have a valid Page Table entry that 
permits both reading and writing. This will prevent page 
faults from occurring during INTRAP or reti, because 
the supervisor stack is the only memory area 
referenced by these sequences. 


6.2. Traps 

Traps are signalled in the CPU chip or by either of the 
CAMMUs. There are 20 predefined traps, shown in 
Table 7. 


Both conditional and unconditional traps are supported 
(see Table 8). Conditional traps are enabled by flags in 
the PSW and the CAMMU Control Register and occur 
only when enabled. Conditional traps that are disabled 
can be detected and handied by the executing program. 


Table 8 Conditional and Unconditional Traps 


Conditional Traps Unconditional Traps 


Corrected Memory Error | Uncorrectable Memory 

















Floating-Point Arithmetic Error 
Trap Group Page Fault 
Trace Protect Faults 





Privileged Instruction 
Itegal Operation 
Integer Divide by Zero 
Supervisor Call 


Alignment Fault 
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Return Address 
(Saved In Supervisor Stack) 


Faulting Instruction 
Next Instruction To Be Executed 
Next Instruction To Be Executed 
Faulting Instruction 

Faulting Instruction 

Following Instruction 
Following Instruction 


Table 9 Trap Handler Environment Summary 


When Trap Is Taken 


During Execution 
After Execution 

After Execution 

Before Execution 
Before Execution 
After Execution 
After Execution 





























Data Memory Trap Group 
Floating-Point Arithmetic Trap Group 
Integer Arithmetic Trap Group 
Instruction Memory Trap Group 
Illegal Operation Trap Group 
Diagnostic Trap Group 

Supervisor Call 















Traps may be generated at various stages of instruction in a predetermined location, where it may be accessed 
processing, as shown in Table 9. The CLIPPER C300's for maintenance by the operating system. 

internal trap logic ensures that the saved program 

counter points to the instruction at which the trapped The operating system may ignore indications of cor- 
program may be correctly restarted. rected memory errors (MSBE/RETRY) by clearing the 


ECM flag in the SSW. 
6.2.1. Data Memory Trap Group 


Data memory traps occur when the data cache inter- Alignment Fault 

face reports a fault. These traps cause the faulted in- An alignment fault occurs when the ATE flag in the 
struction, as well as subsequent instructions already in D-CAMMU's Control Register is set, and one of the 

the upper pipeline, to be backed out. following data accesses is attempted: 

Data memory traps are recorded in the PSW's memory — halfword (16-bit) access to an odd address (AD<0>=1) 
trap status (MTS) field. The MTS field is also used by 

the instruction memory trap group for the same fault — word (32-bit) access with AD<1:0>0, except for 
conditions. Interpretation is not ambiguous because in- accesses to CAMMU I/O space, where AD<1:0> are 
struction memory traps are deferred until data memory used to select TLB lines or CAMMU registers (see 
traps have been serviced, and they are serviced by dif- Section 7.6.6, CAMMU Register Access ) 


ferent trap handlers. 
— doubleword (64-bit) access with AD<2:0>20 
In the case of the pushw and popw instructions, the 





stack pointer is decremented or incremented in parallel -—— byte or halfword access to CAMMU I/O space 
with the instruction’s memory access. Thus, when a 
data memory trap occurs during a pushw or popw in- The address saved on the supervisor stack is the 
struction, the operating system must, before restarting program address of the instruction that caused the align- 
the program, restore the stack pointer to the value it ment trap. The virtual address of the data memory loca- 
had prior to the trapping instruction, i.e., decrement the tion that generated the trap is saved in the CAMMU's 
stack pointer by 4 for popw, or increment the stack Fault Register. 
pointer by 4 for pushw. 

Page Fault 
Corrected/Uncorrectable Memory Errors A page fault occurs when a program attempts to access 
Corrected and uncorrectable data memory errors are a page for which there is no valid entry in the currently 
detected by memory and communicated to the CAMMU assigned Page Directory or Page Tables. The operating 
via the two system bus signals, MSBE/RETRY and system uses this fault to allocate pages to user or super- 
MMBE respectively. It is the responsibility of memory to visor programs. The address saved on the supervisor 


save the real memory address of the location that failed 
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system uses this fault to allocate pages to user or super- 
visor programs. The address saved on the supervisor 
stack is the program address of the instruction that 
caused the page fault. The virtual address of the data 
memory location that generated the fault is saved in the 
CAMMU’s Fault register. 


Read/Write Protection Faults 

Read/write accesses to each page are validated by a 
comparison of the U, K, UU, and KU flags in the SSW 
with the protection code in the TLB or user page tables. 
When an access violation occurs, the address saved on 
the supervisor stack is the program address of the in- 
struction that caused the fault. The virtual address of 
the data memory location that generated the fault is 
saved in the CAMMU's Fault register. 


6.2.2. Floating-Point Arithmetic Trap Group 

There are five distinct floating-point exceptions which 
are specified in the IEEE Standard 754. These excep- 
tions are signalled by the FPU in the case of invalid 
operation, inexact result, overflow, underflow, or divide 
by zero. For each exception, there corresponds a fioat- 
ing-point exception flag in the PSW. The corresponding 
bit is set on any occurrence of the exception. 


In addition, for each exception there exists a floating- 
point trap enable flag. There is also a floating-point 
group trap enable flag. When an exception arises for 
which the individual trap enable flag is true and the 
group trap enable flag is true, then a floating-point trap 
is invoked and control is transferred to a user-specified 
trap handler. If the group trap enable is false, then the 
trap is not invoked. If the individual trap enable flag is 
false, then the trap is not invoked. 


For the underflow and overflow exceptions, the behavior 
of the FPU is determined by the values of the floating- 
point trap enable flags as specified in the Standard. In 
particular, overflows with the overflow trap disabled 
deliver infinity or max_real, whereas the result with the 
trap enabled is the normalized result with the exponent 
distorted, as discussed below. Underflows are handied 
similarly. 


The software knows which floating-point trap has oc- 
curred because each floating-point trap invokes a 
separate trap handler (each has its own entry in the Ex- 
ception Vector Table). It is not sufficient to examine the 
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floating-point exception flags, since the state of these im- 
mediately before executing the exceptional operation is 
generally unknown. 


Floating Overflow 

The floating overflow exception is signalled when the 
biased exponent of the result (after rounding) is greater 
than the largest finite representable exponent. With addi- 
tion and subtraction, overflow occurs when two large 
numbers are added. At least one of them must have a 
biased exponent of +126 (single-precision) or +1022 
(double-precision) and the fraction addition (or the sub- 
sequent rounding) has a carry out of the msb position. 
The overflow may coincide with the fraction sum being 
inexact, though this is not necessarily the case. With 
multiplication, overflow occurs if, after normalization and 
rounding, the product of two finite non-zero numbers 
has an exponent greater than +127 (single-precision) or 
+1023 (double-precision). Overflow for multiplication 
may be exact or inexact. 


If the EFV flag is set, the computed result is delivered 
to the destination with the normalized rounded fraction 
of the true result (though the delivered exponent is 
usually wrong because of missing additional leading bits 
in the exponent field). For single-precision overflows, if 
the biased exponent of the true result is 255, then 
biased exponent 255 is delivered to the destination. If 
the true biased exponent of the result is 256 . . 408, 
then the true biased exponent minus 256 is delivered to 
the destination. Note that this is not the exponent 
wrapped result called for by the IEEE 754 specification; 
the wrap must be adjusted by system software before 
delivery to a user's trap handler. This is done to allow 
the user to provide software that handles traps in an ap- 
plication-specific way. For double-precision, the overflow 
exponents (biased) lie in the range 2047 . . 3120. 
These are mapped to 2047 and 0 . . 1072 respectively. 
These must be adjusted by (3/4)x2"! (1536) to obtain 
the IEEE Standard wrapped exponent. 


If the EFV flag is clear, then the computed result is dis- 
carded, and the properly signed default value (infinity or 
max_real, depending on rounding mode) is delivered to 
the destination. Max_real is the maximum representable 
value in the given floating-point format; single 

max_real = 2'28 - 2!4. double_max_real = 2'0% - 2977, 
The floating inexact exception is also signalled. If the 
rounding mode is round toward zero, the value 
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delivered to the destination is the maximum finite repre- 
sentable number (max_real) with the appropriate sign. If 
the rounding mode is round toward + o, then a positive 
signed overflow is replaced with + 2, while a negative 
signed overflow is replaced by minus max_real. For 
round toward - o, a positive overflow goes to plus 
max_real, while a negative overflow goes to - ~. 


Floating Inexact 

The floating inexact exception is signalled when the 
result of an operation cannot be exactly represented in 
the precision of the destination. The result is rounded 
according to the rounding mode specified in the PSW 
so that it has the precision of the destination, and then 
the rounded result is delivered to the destination. 


Floating Underflow 

The conditions under which the floating underflow excep- 
tion condition is signalled differ according to the setting 
of the EFU flag. If EFU is set, the floating underflow ex- 
ception is signalled when the result of an operation 
(before rounding) has a biased exponent less than the 
minimum representable biased exponent for a normal- 
ized number. If the true biased exponent of the result is 
zero, then biased exponent zero is delivered to the des- 
tination. !f the true biased exponent is less than zero, 
then the exponent delivered to the destination is true 
biased exponent plus 256 (2048 for double.) The ex- 
ponent must be adjusted by system software before 
delivery to the program's trap handler in order to con- 
form to the IEEE 754 Specification. The range of under- 
flowed biased exponents for single-precision is 0. . 
-275; for double-precision the range is 0. . -1125. 


If the EFU flag is clear, then the underflowed fraction is 
right shifted as the exponent is incremented until the 
biased exponent equals one. At this point, the result has 
been restated as a denormal number. If this repre- 
sentation is exact, then no underflow exception is sig- 
nalled. If the representation is inexact, then the result is 
rounded and delivered to the destination, and both the 
inexact and underflow exceptions are signalled. 


Floating Divide by Zero 

The floating divide by zero exception is signalled when 
the divisor is zero and the dividend is non-zero and 
finite. If the dividend is also zero, the result is the 
default quiet NaN (all ones in the fraction and exponent 
fields), and the FI flag is set. If the dividend is infinite, 
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the result is infinite, and no condition flags are set. The 
default result is a correctly signed infinity. 


Fioating Invalid Operation 
The floating invalid operation exception is signalled in 
the following cases: 


1. One of the operands is a signalling NaN. 
2. Add/Subtract, magnitude subtraction of infinities: 
(+ 02) — (+ 9) 
or (+00) + (— 0) 
or (+) — (— ) 
or (+ 00) + (+ 0) 


3. Multiplication’ 
0 x 0 
or ox0O 
4. Division 
0+0 


Or co + oo 


The value written to the destination is always a NaN. 
The NaN is either the NaN operand (the second 
operand if both are NaNs) made quiet if it were 
signalling (by setting the msb of the explicit fraction 
field), or the default NaN created by the hardware. The 
default NaN is quiet, and its fraction field is all ones. 


6.2.3. Integer Arithmetic Trap Group 
The CPU trap status field in the saved PSW indicates 
the cause of the integer arithmetic trap. 


Integer Divide by Zero 

The integer divide by zero exception is signalled when 
an integer divide or mod instruction is executed with 
zero divisor. 


Integer divide by zero cannot be disabled. The result of 
the trapped instruction will not be written to the 
specified register. 


6.2.4. instruction Memory Trap Group 

instruction memory faults are detected and signalled by 
the instruction interface. These traps are not acted upon 
when first sensed, i.e., if a branch instruction or other 
sequence altering event occurs between the time that 
the instruction interface detects the trap condition and 
when that instruction arrives at the C stage, then the 
pending trap condition is cleared and the trap is 
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deferred. A deferred trap will not trap until it is ready to 
be issued for execution. If pre-empted by another trap, 
it may trap later if the code is restarted. The faulting in- 
struction has not yet entered the lower pipeline when 
the trap is taken. The program address saved is that of 
the faulting instruction. 


For instruction memory traps, the memory trap status 
(MTS) field in the saved PSW indicates the reported 
error. 


Corrected/Uncorrectable Memory Error 

Corrected and uncorrectable data memory errors are 
detected by memory and communicated to the CAMMU 
via the two system bus signals, MSBE/RETRY and 
MMBE respectively. It is the responsibility of the 
memory system to save the real memory address of the 
location that failed in a predetermined location in 
memory, where it can be accessed for maintenance by 
the operating system. 








The operating system may ignore indications of cor- 
rected memory errors by clearing the ECM flag in the 
SSW. 


Alignment Fault 

An alignment fault occurs when the ATE bit in the 
|-CAMMU’s Control Register is set to 1, and an instruc- 
tion fetch is attempted from an odd address in memory 
(where AD<0>=1). 


The address saved on the supervisor stack is the 
program address of the instruction that caused the align- 
ment trap. The virtual address of the memory location 
that generated the trap is saved in the CAMMU's Fault 
Register. 


Page Fault 

A page fault occurs when a program tries to access a 
page for which there is no valid entry in the currently as- 
signed Page Directory or Page Tables. The operating 
system uses this fault to allocate pages to user and su- 
pervisor programs. The address saved on the super- 
visor stack is the program address of the instruction that 
caused the page fault. The virtual address of the 
memory location that caused the fault is saved in the 
CAMMU's Fault register. (The two addresses may differ 
for multiple-parcel instructions.) 
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Execute Protect Fault 

Instruction fetches from each page are validated by a 
comparison of the U, K, UU and KU flags in the SSW 
with the protection level in the TLB or user's page 
tables. When an instruction fetch violation occurs, the 
address saved on the supervisor stack is the program 
address of the instruction that caused the fault. The vir- 
tual address of the memory location that caused the 
fault is saved in the CAMMU's Fault register. 


6.2.5. Illegal Operation Trap Group 

Illegal operation traps are taken before the instruction is 
executed. The program address saved on the super- 
visor stack is the address of the instruction which 
caused the trap. The CPU trap status field in the saved 
PSW indicates the type of trap. 


iHegal Operation Fault 

An illegal operation trap results from the attempted ex- 
ecution of any undefined instruction opcode or the occur- 
rence of an addressing mode which is not specifically 
allowed. 


Privileged Instruction Fault 
A privileged instruction trap occurs when a privileged 
macro instruction is encountered in user mode. 


6.2.6. Diagnostic Trap Group 


Trace Trap 

Unless pre-empted by another trap or interrupt, the 
trace trap occurs following the execution of an instruc- 
tion whenever the PSW's T (trace trap enable) flag is 
set. For traced instructions which are interrupted or 
cause traps, the TP flag is set by hardware when the in- 
terrupt or trap occurs to ensure that the trace trap will 
occur immediately after the interrupt or other trap has 
been serviced. In the case of data page faults, TP must 
be cleared by the supervisor before restarting the fault- 
ing instruction to ensure that the instruction is traced 
exactly once. 


M! ROM sequences are treated as a single instruction 
for trace purposes so that the entire sequence executes 
before the trace trap is taken. 


At the time of the trap, the CPU trap status field in the 
saved PSW indicates that a trace trap has occurred. 
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The saved PC is the address of the instruction following 
the instruction that caused the trace trap. 


6.2.7. Supervisor Calls 

A supervisor call is an instruction executed as a trap, 
and is made using the calls instruction. Its purpose is to 
provide controlled access to system-level functions. 
There are 128 supervisor call codes, with separate Vec- 
tor Table entries for each. The PC value saved on the 
stack is the address of the instruction following the calls 
instruction. 


6.2.8. Multiple Traps 

Only traps in the data memory and floating-point groups 
can be signalled at the same time. CLIPPER internal 
trap logic permits correct recovery of both faulting in- 
structions. INTRAP transfers control to the floating-point 
trap handler, and the leadfs instruction can be used to 
access the floating-point instruction that caused the 
trap. The MTS field in the saved PSW may be read by 
the floating-point trap handler to determine which data 
memory trap occurred. 


6.3. Interrupts 

The C300 supports 16 prioritized interrupt levels, with 
each level containing interrupt numbers of equal priority. 
Level 0 (highest priority) contains 15 numbers; levels 1- 
15 each contain 16 numbers. In addition to the 16 inter- 
rupt levels, there is a non-maskable interrupt which has 
a higher priority than all interrupt levels and cannot be 
disabled by software. Level 0 Number 0 vectors to the 
NMI interrupt handler. 


The CPU contains the logic necessary to arbitrate inter- 
rupt requests according to the priority of the interrupt 
level. The interrupt level currently being processed is 
stored in the Interrupt Level (IL) field of the SSW. The 
CPU accepts interrupts only for interrupt levels of equal 
or higher priority than the current interrupt level. 


Interrupts are serviced between instructions, that is, in- 
terrupt requests are not acknowledged until instructions 
in the lower pipeline have finished executing, any result- 
ing traps have been serviced, and memory transactions 
have concluded. Thus, interrupts are not normally per- 
mitted during a macro sequence, which is considered a 
single instruction. However, some macro sequences (for 
example, the string instructions) permit interrupts peri- 
odically during their execution. 
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6.3.1. Maskable Interrupt Request/Acknowledge 
Protocol 

Priority interrupts are requested by the activation of the 

IRQ input line and the assertion of the vector number 

on IVEC<7:0>. The vector number includes the interrupt 

level on IVEC<7:4> and the interrupt number on 

IVEC<3:0>. 








An interrupt request will be acknowledged by the CPU if 
interrupts are enabled (the interrupt enable flag in SSW 
is set) and the interrupt level (IVEC<7:4>) is of equal or 
higher priority than the interrupt level contained in the 
SSW's Interrupt Level (IL) field. To maximize interrupt 
responsiveness following the assertion of TRG and 
IVEC, the interrupt level can change to higher priority 
on any BCLK until IRQ is released. See Figure 14. 


The CPU samples IRQ on the rising edge of every 
BCLK if interrupts are enabled and the priority condition 
is met. The CPU then enters the pre-trap state, in which 
the execution pipeline is emptied by withholding issue of 
the instruction in the issue and control phase. The in- 
structions in the execution pipeline complete executing: 
if their execution causes a trap to be signalled, the inter- 
rupt is deferred and the (higher priority) trap is serviced. 
The CPU then asserts IACK. 


The CPU latches the interrupt number and level on the 
BCLK following the release of IRQ, and releases [ACK 
during the following BCLK. 





The maskable interrupt request/acknowledge timing is 
shown in Figure 59. See also Section 9.4.8, Interrupt 
Bus. 


6.3.2. Non-Maskable Interrupt 

The non-maskable interrupt is signalled on the NMI 
input to the CPU which is sampled on the rising edge of 
every BCLK. An active low on NMI greater than the 
BCLK period will trigger this interrupt. NMI remai remains ac-_ 
tive until acknowledged by the CPU on NMIACK. If NMI 
is asserted after another interrupt has already been ac- 
knowledged, the non-maskable interrupt is serviced 
after completion of the INTRAP sequence for the ac- 
knowledged interrupt. The NMi request/acknowledge 
timing is shown in Figure 60. See also Section 9.4.8, In- 
terrupt Bus. 
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Figure 14 Interrupt Fiow Diagram 
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7. Cache and MMU 

The CLIPPER C300 contains two Cache/Memory 
Management Unit (MMU) combination VLSI chips called 
CAMMUs which are designed to optimize CLIPPER per- 
formance. 


Each CAMMU contains a 4 K-byte data cache, anda 
memory management unit which translates CPU 32-bit 
virtual addresses into 32-bit real addresses. One 
CAMMU is used for CPU instruction fetching and cach- 
ing and is interfaced to the CPU Instruction Bus; the 
second CAMMU is used for CPU data transfers and 
caching and is interfaced to the CPU Data Bus. Both 
CAMMuUs also interface to main memory and 1/O 
devices via the CLIPPER Bus. 


The two CAMMUs are functionally identical, but each is 
hardware programmed via an external chip pin for use 
as either an instruction CAMMU (I-CAMMU) or a data 
CAMMU (D-CAMMU). 


Figure 15 CAMMU Interface 
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The CAMMUs feature several caching policy and Bus 
Watch options which allow optimum performance 
tailored to specific applications. A prefetch option is 
available for the -CAMMU; and fixed address transla- 
tion is used in both the -CAMMU and the D-CAMMU 
for guaranteed access of selected locations in main 
memory, Boot, and I/O spaces. In addition, CAMMU in- 
ternal registers and register fields are easily accessed 
for efficient CAMMU configuration and control. 


7.1. Functional Overview 

The two main functional units of the CAMMU are the 
cache and the Memory Management Unit (MMU), with 
the MMU comprised of the Dynamic Translation Unit 
(DTU), the Translation Lookaside Buffer (TLB), and the 
Hardwired Translation Lookaside Buffer (HTLB) (see 
Figure 15). The CAMMU also utilizes a cache control 
unit which controls CAMMU data fetches from main 
memory. 
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Figure 16 Basic CAMMU Functional Flow CAMMU operation begins when the CPU asserts a vir- 
tual address on the CPU-CAMMU address/data bus. 
The task of the CAMMU is to translate the CPU virtual 
address (bits <31:12>) into a real address and to use 
the translated real address to find the data. 


The CAMMU compares the virtual address with a virtual 
address of the data stored in a 16-byte (Quadword) Buf- 
fer containing the most recently accessed cache line. If 
there is a match, the data is fetched directly from the 
Quadword Buffer and no additional cache or TLB action 
is performed. If there is no Quadword Buffer match, the 
CAMMU attempts to translate the address by using the 
TLB, which is a look-up table containing Virtual Address 
Tags and associated Real Address fields which point to 
locations in the cache. !f the CAMMU finds a Virtual Ad- 
dress Tag in the TLB which matches the CPU virtual ad- 
dress, it compares the associated Real Address field in 
the TLB with the Real Address fields of a cache line set, 


ACCESS TLB already selected by virtual address bits <10:4>, to deter- 
Constnncen mine whether the data is in the cache. If the data is not 
in the cache, the CAMMU accesses main memory for 
the data. 


If the CAMMU cannot find a matching Virtual Address 
Tag in the TLB, it invokes the DTU to search declared 
blocks of main memory (Page Directory Tables and 
Page Tables) in an attempt to translate the virtual ad- 
dress. 


The DTU, upon successful translation of the virtual ad- 
dress, updates the TLB with the new Virtual Address 
Tag/real address association. The GAMMU then con- 
tinues with data access. If the DTU cannot find the valid 
translation in main memory, the CAMMU asserts a CPU 
page fault trap for resolution by the operating system. 


Each CAMMU cache consists of 256 quadwords of data 
(4 K-bytes) with associated Real Address Tags in a con- 
figuration similar to the TLB. The CAMMU searches the 
Quadword Buffer and the onboard cache first, then 
main memory for addressed data locations if required. 


TRANSFER A basic logic flow of CAMMU operation is shown in 
DATAINSTR Figure 16. 
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Figure 17 CPU Virtual Address Format 
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Figure 18 Simplified CAMMU Block Diagram 
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Figure 17 depicts the format of the CPU virtual address 
and indicates how the various virtual address fields are 
used by the CAMMU. Figures 18 and 19 show CAMMU 
operation. These figures should be referred to while 
reading the following CAMMU descriptions. 


7.2. Memory Management Unit (MMU) 

The Memory Management Unit translates CPU virtual 
addresses into real addresses and supports address 
space access protection by the operating system on a 
per-page basis. 


Figure 19 CAMMU Block Diagram 
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Address translation is executed by three functional units 
within the MMU: the Translation Lookaside Buffer (TLB), 
the Hardwired TLB (HTLB), and the Dynamic Transla- ~ 
tion Unit (DTU). Address space access protection and 
memory management support are performed by logic 
within the MMU which utilizes system tags and protec- 
tion codes associated with the virtual memory pages. 


7.2.1. Translation Lookaside Buffer (TLB) 
The TLB is a two-way set-associative memory array 
that is used by the CAMMU for fast, on-board virtual ad- 
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dress to real address translation. It consists of 64 sets 
of lines, with each set consisting of a W and an X com- 
partment line, and an associated U flag (see Figure 20). 
The CAMMU uses bits <17:12> of the CPU virtual ad- 
dress to select a TLB line set, then compares bits 
<31:18> of the virtual address with the VA (Virtual Ad- 
dress) Tag of both the W and X Compartment lines of 
the selected set. 


If there is a match, and if the appropriate access protec- 
tion code allows the data/instruction access, the 20-bit 
RA (Real Address) field of the matching W or X line is 
multiplexed and transferred to the cache as real ad- 
dress bits <31:12> where they, along with virtual/real ad- 
dress bit 11 (this bit is not translated), are used to 
validate the data. 


If there is no TLB match, the DTU attempts address 


translation, as explained in Section 7.2.3, Dynamic 
Translation Unit. 


Figure 20 CLIPPER TLB 
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TLB Line Description 

TLB line format is shown in Figure 21. Equivalent RA, 
ST, PL, D and R flags are located in the Page Tables. 
The CAMMU ensures that the D and R flags in the 
Page Table entries are updated with flag changes in the 
TLB. 


The TLB line field definitions are as follows: 


SV: Supervisor Valid 

The SV flag when set indicates that the TLB line is 
used for address translation only during supervisor 
mode operation. All TLB SV flags can be cleared as a 
group by writing to the CAMMU Reset Register and by 
a hardware reset. 


UV: User Valid 

The UV flag when set indicates that the TLB line is 
used for address translation only during user mode 
operation. All TLB UV flags can be cleared as a group 
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by writing to the CAMMU Reset Register and bya 
hardware reset. 


VA: Virtual Address Tag 

This 14-bit field is used for W or X line selection once 
TLB set selection is complete. The VA Tag of each line 
is compared with CPU virtual address bits <31:18>. If 
there is a match, the matching line is used for the ad- 
dress translation. 


RA: Real Address 

This 20-bit field is used as real address bits <31:12> 
once the TLB line has been matched, and access 
protection and validation checks have been completed 
(see UV, SV, and PL descriptions). 


ST: System Tag 

This is a three-bit field which identifies the caching type, 
caching policy, and address space associated with the 
page referenced by the TLB line as follows: 

ST<2:0> Description 

private, write-through, main memory space 
shared, write-through, main memory space 
private, copy-back, main memory space 
noncacheable, main memory space 
noncacheable, I/O space 

noncacheabie, Boot space 

cache purge 

nencacheable, main memory space, slave 
VO mode 


NO hk ON = 0 


Figure 21 TLB Line Format and Description 
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This field is used only in mapped mode. In unmapped 
mode, the UST (Unmapped System Tag) field in the 
CAMMU Control Register is used, as described in 
Section 7.6.4. 


The System Tag is asserted on CLIPPER Bus lines 
TG<2:0> during CAMMU extemal accesses. Further in- 
formation on the System Tag field is provided in Section 
7.4, System Tag. 


S: System Reserved 
This is a general-purpose, two-bit field reserved for use 
by the operating system. 


PL: Protection Level 

Associated with each virtual address is a function code 
asserted by the CPU which identifies each memory 
reference as a read, write, or instruction fetch operation, 
and indicates the states of the U, UU, K, and KU flags 
in the CPU's SSW. These SSW flags indicate whether 
the memory access is by the supervisor or by a user, 
which protect key is to be used for access verification 
(K or KU), and the state of the key. The CAMMU com- 
pares the function code with the 4-bit PL field of a 
selected TLB line to determine whether read access, 
write access, and instruction fetching is allowed. 


The Protection Level fields are used only for CPU 
mapped addresses (addresses asserted while the 
mapped mode bit of the SSW is set). Unmapped 

addresses invoke no access protection. 
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Table 10 shows allowed accesses according to the the operating system uses this flag as part of a main 
SSW’s K, U, UU, and KU flags, and the PL field. memory page replacement algorithm by periodically 
clearing all the TLB R flags via the Reset Register (see 
D: Dirty Flag Section 7.6.5, Reset Register), then allowing them to be 
The Dirty flag is set by the CAMMU to indicate that the set during normal program execution. When the operat- 
4 K-byte page in main memory referenced by the TLB ing system replaces a main memory page, it selects an 
line has been altered. Typically the operating system “unreferenced” page for replacement based on the 
uses this flag to determine whether the referenced data states of the R flags. 
page must be copied to secondary storage (such as a 
hard disk) when the data in the page is replaced. U: Used Flag 
Associated with each TLB line set is a U (Used) flag 
R: Referenced Flag which is set by the CAMMU to indicate that the W line 
The CAMMU sets the R flag to indicate that the page of the set was last accessed and cleared to indicate 
associated with the line has been accessed. Typically that the X line was last accessed. When a new line has 
to be entered into the TLB as a result of a TLB miss, 
Table 10 Page Access Encoding the least recently used line in the selected set is 





Supervisor Mode User Mode replaced based on the state of this flag. 
(U=0) (U=1) 


D- and 
I-CAMMU 


7.2.2. Fixed Address Translation 

The CAMMuUs feature hardwired TLB lines which en- 
sure TLB hits of special memory pages by both mapped 
and unmapped addresses while the CPU is executing in 
supervisor mode. These hardwired entries eliminate 
page faults during INTRAP and reti sequences, and 
allow access of Boot and I/O space before the Page 
Table Directories and Page Tables have been initialized 
during system booting. 






The hardwired TLB (HTLB), implemented in random 
logic and not visible to software, contains the functional 
equivalents of the VA, RA, ST and PL fields found in the 
writable TLB. However, equivalents to other TLB fields 
are not used in the HTLB. The HTLB can be accessed 
only during CPU supervisor mode, so UV and SV flags 
are not required. HTLB lines cannot be replaced, so no 
Used flags are required. Pages referenced by the HTLB 
are dedicated pages not subject to general replacement 
by the operating system, so no Referenced or Dirty 
flags are required. 


HTLB Mapping 

The hardwired TLB is invoked only when supervisor 
pages 0-7 are addressed by the CPU. With the excep- 
tion of CAMMU I/O space (the upper half of page 0 of 
No Access Allowed VO space), these pages can also be mapped through 
Read Only Allowed the writable TLB. Virtual pages other than supervisor 


Read and Write Allowed pages 0 - 7 must be mapped through the writable TLB. 
Read and Execute Allowed 
RWE Read, Write, and Execute Allowed 
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When a CAMMU detects a supervisor page 0 - 7 virtual 
address, it selects the Real Address, System Tag, and 
Protection Level fields from the appropriate HTLB line. 
Pages 0 - 3 and 6 - 7 are protection free, allowing read, 
write and execution accesses. Pages 4 and 5 allow only 
read and write access, and attempted instruction fetch 
from these pages results in a protection fault. The real 
address translation and system tag assigned to each 
page is shown in Table 11. 


The address space assigned to each of the virtual 
pages is also shown in Table 11. These address space 
assignments are determined by the System Tag, which 
is asserted on CLIPPER Bus lines TG<2:0> during 
CLIPPER Bus access. These lines function as main 
memory, I/O and Boot space selects and must be 
decoded by system hardware for proper device 
selection. 


Figure 22 contains a pictorial overview of Hardwired 
TLB mapping showing the three distinct real address 
spaces into which virtual pages 0 - 7 are mapped. 


Virtual Page 0 - 7 Assignments 
Three of the "hardwired" virtual pages are available for 
general use. The other five are dedicated for specific 


Table 11 Hardwired TLB Address Translations 











Virtual 
Page 
Number |Number 


Page 
Assignment 






General-Purpose and 
Interrupt and Trap Vectors 
General-Purpose 
General-Purpose 
General-Purpose 
VO, CAMMUS and Reserved 
ie) 
General-Purpose 

General-Purpose 


NI 
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purposes. These page assignments are shown in 
Table 11. 


The CPU fetches interrupt and trap vectors from super- 
visor virtual page 0. The CAMMUs translate this page 
into page 0 of main memory where the vectors must be 
located. 


Figure 22 Hardwired TLB Mapping 
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System Tag 


ST, TG Description 


shared, write-through, main memory space 

















private, copy-back, main memory space 
noncacheable, main memory space 
noncacheable, main memory space 
noncacheable, I/O space 
noncacheable, I/O space 
noncacheable, Boot space 
noncacheable, Boot space 









Note: The ST field is decoded by the CAMMU during page access. The bits are transferred to the CLIPPER Bus lines 
TG«<2:0> during CLIPPER Bus access. 
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Similarly, following CLIPPER reset, the CPU fetches ini- tion 7.6, Internal Registers). Virtual page § is available 
tial boot code from virtual page 6, which the CAMMUs for /O. The D-CAMMU translates these virtual pages 
translate into page 0 of Boot space. The CAMMUs also into pages O and 1 of real I/O space. Attempted access 
translate virtual page 7 into Boot space (page 1) to of virtual pages 4 or 5 for instruction fetch results in a 
allow a total of 8 K-bytes of HTLB-transiated Boot ad- protection fault. 
dresses. 

Additional pages can be created in Boot or I/O space by 
The first Boot instructions must be located at supervisor assigning the appropriate System Tag (5 or 4) to virtual 
virtual address 6000 Hex, which translates to address 0 pages other than 0 - 7. Translation of these pages, 
of Boot space. The rest of the boot code should be Io- however, is by the writable TLB or the DTU, not by the 
cated in pages 0 and 1 of Boot space as required. Hardwired TLB. 
Virtual page 4 is reserved by Intergraph for CAMMU in- Virtual pages 1- 3 are general-purpose pages which are 
ternal register addressing and for future use (see Sec- translated into main memory pages 1- 3 by the HTLB. 


Figure 23 DTU Virtual Address Translation 
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7.2.3. Dynamic Translation Unit (DTU) 

The DTU translates virtual address bits <31:12> into 
real address bits <31:12> in two steps. First it fetches a 
Page Table Origin from a Page Table Directory located 
in main memory; then it fetches an entry from a Page 
Table, also located in main memory. This sequence is 
depicted in Figure 23. 


Once the DTU has completed address translation, the 
CAMMU updates the TLB and provides the real address 
to validate the cache data, then searches the cache for 
the data. If the data is not cached, the CAMMU 
accesses main memory for the data using the con- 
catenation of translated address bits <31:12> and un- 
translated virtual address bits <11:0>. 


Note that because the DTU accesses only main 
memory and not the cache during address translation, 
all Page Table Directories and Page Tables must be lo- 
cated in noncacheable pages (see Section 7.4, 
System Tag). 


Page Table Directory Entry Selection 

Two 20-bit PDO (Page Directory Origin) registers each 
contain the base address of a Page Table Directory. 
One PDO register is used by the CAMMU during super- 
visor mode operations; the other PDO register is used 
during user mode operations. The DTU concatenates 
bits <31:22> of the virtual address with the contents of 
the appropriate PDO register to form the most sig- 
nificant 30 bits of a Page Table Directory entry address. 
(Page Table Directory entries are word-aligned; there- 


Figure 24 Page Table Format 
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fore bits <1:0> are forced LOW.) In effect, the PDO 
register points to the Page Table Directory, and bits 
<31:22> of the virtual address select one of 1024 Page 
Table Directory entries. 


Page Table Directory Format 

Each Page Table Directory consists of 1024 32-bit 
words located in main memory. Page Table Directory 
entries (see Figure 24) are comprised of two fields. 


PTO: Page Table Origin 
This field is used by the DTU during address translation 
to locate the Page Table in main memory. 


F: Page Fault 

The F flag is set or cleared by the operating system to 
indicate the absence or presence of a valid Page Table 
pointed to by the PTO field in the entry. A set F flag indi- 
cates absence of a valid Page Table, and attempted 
DTU address translation with a Page Table directory 
entry having a set F flag forces a CPU page fault trap. 


Page Table Entry Selection 

The selected Page Table Directory entry contains a 20- 
bit PTO field (see Figure 24) which holds the base ad- 
dress of a Page Table that is to be used for address 
translation. The DTU concatenates bits <21:12> of the 
virtual address with the contents of the PTO field to 
form bits <31:2> of the Page Table entry address (bits 
<1:0> are forced LOW). In effect, the PTO field points 
to the appropriate Page Table, and bits <21:12> of the 
virtual address select one of 1024 Page Table entries. 
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Figure 25 Page Table Entry Format 
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The CAMMU then uses the 20-bit RA (Real Address) 
field in the Page Table entry, shown in Figure 25, as 
bits <31:12> of the real address. 


Page Table Format 

Each Page Table consists of 1024 32-bit words com- 
prised of six fields, as shown in Figure 25. Equivalent 
ST, S, PL, D, and R fields are located in the TLB 
registers. See Section 7.2.1, Translation Lookaside 
Buffer. 


The following are the Page Table entry field descriptions: 


RA: Real Address 

The 20-bits of the RA field are used as real address bits 
<31:12> following address translation. These bits con- 
stitute a 4 K-byte page address. 


ST: System Tag 

This field identifies the caching policy, caching type, and 
address space of the page (see Section 7.4, System 
Tag). 


S: System Reserved 
This is a general-purpose 2-bit field reserved for the 
operating system. 


PL: Protection Level 

The CAMMU uses this field to determine whether data 
read, data write, and instruction fetching are allowed 
to/from the page (see Section 7.2.1, Translation 
Lookaside Buffer). 


Figure 27 CLIPPER Cache Line Format 
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D: Dirty Flag 
The Dirty flag is set by the CAMMU to indicate that the 
page has been altered. 


R: Referenced Flag 
The CAMMU sets the FR flag to indicate that the page 
has been accessed. 


F: Page Fault 

The F flag is set/cleared by the operating system to indi- 
cate the absence/presence of a valid page. A set F flag 
indicates absence of a valid page, and attempted DTU 
address translation with a Page Table entry having a set 
F flag forces a CPU page fault trap. 


7.3. Cache 

The cache architecture is similar to that of the TLB, as 
shown in Figure 26. It is a 4 K-byte cache composed of 
128 sets of lines, with each set consisting of a W com- 
partment line and an X compartment line. 


Associated with each cache line set is a U bit which is 
set to indicate that the W line of the entry was last ac- 
cessed, and cleared to indicate that the X line was last 
accessed. When, as a result of cache miss, a new data 
quadword has to be cached, the least recently used line 
in the selected line set is replaced based on the state of 
this bit. 


7.3.1. Cache Line Description 

Figure 27 shows the cache W and X compartment line 
format. Each line consists of four 32-bit data words and 
LV, LD, and RA fields defined as follows: 


32 32 32 
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LV: Line Valid 

The LV bit, when set, indicates that the data in the as- 
sociated line is valid; when clear, it indicates that the 
data is invalid. 


Aline LV bit is set by the CAMMU when it loads new 
data into the cache line. The bit is cleared by the 
CAMMU, operating as a slave, in response to CLIPPER 
Bus activity when its Bus Watch modes are enabled, or 
by a cache purge operation (TG = 6) as described in 
Section 7.4.2, System Tag 6. Individual LV flags are 
also cleared by hardware when the CV (Clear Valid) bit 
in the CAMMU Control Register is set, and the cache 
provides more current (dirty) data for a quadword /O 
Read (see Section 7.5, Bus Watch Modes, Watch I/O 
Reads). 


In the case of Bus Watch, LV is cleared during a quad- 
word write to a main memory address that matches the 
particular cache location. During the Bus Watch opera- 
tion, the CAMMU asserts the CBSY signal on the 
CLIPPER Bus. The CBSY signal prevents another bus 
transaction from beginning until Bus Watch operation 
has completed. If the CPU has addressed this same 
cache line prior to the CLIPPER Bus's write operation, 
the CPU has priority and the bit is not cleared until the 
CPU access is completed. This is described in more 
detail in Section 7.5, Bus Watch Modes. 





All cache LV flags can be cleared as a group by writing 
to the CAMMU Reset Register (see Section 7.6, Internal 
Registers), and by CLIPPER hardware reset. 


LD: Line Dirty 

The LD bit is set by the CAMMU to indicate that data in 
the cache line has not been updated in main memory. 
This occurs when the CAMMU is operating in the copy- 
back mode (see Section 7.4, System Tag), and a CPU 
write to memory results in a cache hit. In this case, the 
data is written to the cache but not to main memory, 
resulting in “dirty” cache data, i.e., data which is more 
current than main memory data. This bit is cleared by 
the CAMMU when the copy-back data is copied back to 
memory. 


RA: Real Address Tag 

The RA tag is used for cache line selection. The RA 
tags of both the W and X compartment lines are com- 
pared with translated address bits <31:12> and bit 11 of 
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the virtual address. Accessed data is located in a match- 
ing line. 


WO0-W3: Word 0 - 3 
Words 0 - 3 are the four 32-bit data words in the cache 


line. 


7.3.2. Cache Data Selection 

Virtual address bits <11:2> are used directly by the 
CAMMU as real address bits <11:2> to access cache 
data (refer to Figure 19). 


The CAMMU uses address bits <10:4> to select one of 
the 128 cache line sets. The CAMMU compares the con- 
catenation of translated address bits <31:12> and ad- 
dress bit 11 with the RA field of both the W line and the 
X line of the selected line set. If there is no match, a 
cache miss has occurred, and the CAMMU accesses 
main memory for the data transfer. If there is a match, 
the CAMMU uses address bits <3:2> to select one of 
the four data words in the matching line for the data 
transfer. 


7.3.3. Prefetch 

The D-CAMMU implements a “demand” data fetching al- 
gorithm. Data fetching for the cache is “on demand” by 
the CPU; that is, a new data quadword is fetched from 
main memory only as a result of a cache miss. 


The |-CAMMU also supports demand fetching which 
functions identically to D-CAMMU data fetching, but fea- 
tures an optional prefetching algorithm not available in 
the D-CAMMU. This algorithm prefetches the next four 
words of instructions from main memory for line N + 1 
of the cache when line N has been accessed by the 
CPU. |-CAMMU prefetching is controlled by bit 0 of the 
CAMMU Control Register. When bit 0 is clear, prefetch 
is disabled; when set, prefetch is enabled. 


I-CAMMU prefetch enable/disable should be based on 
the general structure of the code being executed. If the 
instructions are in general executed sequentially as 
stored in main memory, the probability of cache hits of 
prefetched instructions is high; therefore, prefetch 
should be enabled for increased CPU throughput. If the 
instructions are branch intensive, the probability of 
cache hits of prefetched instructions is reduced; there- 
fore, prefetch may be disabled to reduce system bus 
traffic. 
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7.3.4. Quadword Data Transfers 

The CAMMU cache lines each contain four words. The 
CAMMuUs transfer data/instructions between the caches 
and main memory four words (one quadword) at a time. 
(Single-word transfers are used for data/instructions in 
noncacheable pages.) 


7.4. System Tag 

Associated with each virtual page is a three-bit ST (Sys- 
tem Tag) field which determines the caching policy that 
applies to the page, the page caching type (private or 
shared), and the page's address space (I/O, Boot, or 
main memory). In addition, the System Tag can be used 
to identify two special operations: Cache Purge and 
Slave /O mode. This field is found in the Page Table 
entries (see Figure 25) and in the TLB (see Figure 21). 


The System Tags for supervisor pages 0 - 7 are 
hardware-selected by the CAMMU (see Section 7.2.2, 
Fixed Address Translation and Table 11). All other vir- 
tual address page System Tags are assigned by the 
operating system according to system requirements 
when it creates the Page Tables. The CAMMUs decode 
the ST fields during address translation and transfer the 
bits to the CLIPPER Bus lines TG<2:0> during 
CLIPPER Bus access. (If the system is being operated 
in unmapped mode, the UST field (Unmapped System 
Tag) in the CAMMU Control Register determines the 
System Tag.) 


7.4.1. System Tags 0-5 

System Tags 0 - 5 encode the following information 

about a virtual page: 

(1) Address space— main memory, Boot, or I/O. 

(2) Caching type—private or shared. 

(3) Caching policy—cacheable or noncacheable; write- 
through or copy-back. 


Address Space 

“Address space” identifies the real address space of the 
page as either main memory space, Boot space or I/O 
space. The CAMMUs map all virtual addresses into one 
of these spaces. 
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Caching Type 

Two types of page caching are recognized by the 
CAMMU: private and shared. Private caching pages are 
accessed and cached by one CAMMU only. Shared 
caching pages are accessed and cached by more than 
one CAMMU. (Pages that are cached by both the 
I-CAMMU and D-CAMMU of a CLIPPER Module are 
shared pages.) 


Note that the terms “shared” and “private” relate only to 
CAMMU access. In fact, a page may be private to a 
CAMMU but accessible by non-CAMMU devices. This 
page, though private, is commen to one or more 
devices other than the CAMMU, and should therefore 
receive special consideration when assigning System 
Tags. 


Caching Policy 

Caching policy is a set of attributes assigned to a page 
by the System Tag which identifies the page as cache- 
able or noncacheable, and, if cacheable, defines the 
caching mode which applies to the page as copy-back 
or write-through. Combinations of write-through or copy- 
back caching to private pages, and write-through cach- 
ing to shared pages are possible. 


Cacheable data can be entered into a cache; noncache- 
able data cannot be entered into a cache. Pages can 
be tagged as cacheable or noncacheable according to 
system requirements. 


Write-through and copy-back are two schemes for updat- 
ing main memory with data in the D-CAMMU cache. 
Selection of these modes is based on system require- 
ments and page caching type. Private pages may be 
write-through or copy-back; shared pages must be write- 
through. 


During a CPU write, the CAMMU searches the cache 
for the accessed location. If the location is not in the 
cache (a cache miss), the CAMMU operates according 
to the caching mode as follows: 


(1) Write-through—the CAMMU updates main memory 
with the CPU data but does not update the cache be- 
cause the data is not cached. 
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(2) Copy-back—the GCAMMU reads the quadword con- 
taining the addressed data from main memory into the 
cache, then updates the cache, but does not update 
main memory. 


If the data is in the cache (a cache hit), the CAMMU 
operates according to caching mode as follows: 


(1) Write-through—the CAMMU updates both the 
cache and the main memory. 


(2) Copy-back—the CAMMU updates only the cache, 
but does not update main memory. 


Write-through mode forces the D-CAMMU to access the 
CLIPPER Bus and update main memory immediately fol- 
lowing a cache write. Write-through mode thus ensures 
that main memory data is current with the cache at all 
times. Shared pages (those that can appear in more 
than one cache) must be write-through. 


Copy-back mode inhibits updating of main memory with 
the new data until the cache line written into is 
replaced. When a copy-back write hit to the cache oc- 
curs, the LD (Line Dirty) flag in the hit line is set to indi- 
cate that the line data must be written to main memory 
when the line is replaced. Since copy-back mode does 
not assure data consistency between main memory and 
the cache at all times, copy-back mode cannot be used 
for pages that are shared by another CAMMU. 


Write-through mode eases the task of data manage- 
ment because main memory is always “up to date” but 
increases CLIPPER Bus traffic because the CAMMU 
must access the bus following each cache write. Copy- 
back requires more careful data management considera- 
tion but decreases system bus traffic, thereby 
significantly improving system performance. These fac- 
tors should be considered when assigning System Tags. 


7.4.2. System Tag 6—Cache Purge 
System Tag 6, Cache Purge, forces invalidation (purg- 
ing) of cache lines that are hit during CPU write opera- 
tions that are tagged TG = 6. The lines are invalidated 
by clearing of the Line Valid (LV) bits. 


ACPU write with the ST field in the TLB set to a 6 will 
purge hit cache lines of its own caches. The write 
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proceeds as normal with TG lines = 6 on the CLIPPER 
Bus, causing hit lines in other module caches (having 
Watch CPU writes enabled) to be purged. Any cache 
with Watch CPU writes enabled will purge hit lines 
(regardless of the state of the TLB system tag field) 
when a write is detected on the CLIPPER Bus with 
TG =6. 


The Cache Purge feature facilitates the re-use of pages 
by allowing invalidation of data belonging to a replaced 
page which is left in a cache. In multiple CLIPPER 
Module applications, for example, a page might be 
replaced in main memory which may leave unpurged 
data in a cache of the module not initiating the page re- 
placement. The CAMMU initiating the page replacement 
can invalidate that cache data by writing to the cached 
data locations using the Cache Purge tag. 


7.4.3. System Tag 7—Slave /O 

System Tag 7, Slave I/O Mode, in effect allows the 
module to act as a DMA controller. It supports transfer 
of data between I/O and main memory in DMA-like 
fashion, but is not intended to replace DMA controllers. 


During Slave I/O operation, the CLIPPER Module 
accesses an individual word, halfword, or byte from a 
source (such as main memory) which is simultaneously 
read by a destination device. Both read and write opera- 
tions can be tagged Slave /O mode. 


Slave V/O read operations are used to transfer data from 
main memory to an I/O device. The CLIPPER Module 
executes a read from memory with TG = 7, and the 
memory responds with the data which is read by the /O 
but can be ignored by the CPU. The I/O must recognize 
TG =7 as Slave I/O mode and assert RDYi to terminate 
the operation. Timing for the Slave I/O read operation is 
the same as for a standard read. 


Slave I/O write operations are used to transfer data 
from I/O to main memory. The C300 executes a write to 
memory with TG = 7 using arbitrary data that will not be 
asserted on the bus. The I/O must recognize TG = 7 as 
slave I/O mode and assert data on the bus. The main 
memory asserts RDYi to terminate the operation. 


Timing for the Slave I/O write operation is the same as 
for a standard write, with the exception that DIR transi- 
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tions after the address phase of the operation as if for a 
read. This prevents the external bus transceivers from 
driving the bus during the data transfer phase of the 
operation, allowing the I/O to send data without bus con- 
tention. 


7.5. Bus Watch Modes 

Bus Watch modes are used by the CAMMU to ensure 
data consistency between the cache and main memory, 
and to transfer the “latest data to an external device 
reading main memory. 


When Bus Watch is enabled in a CAMMU, it monitors 
main memory accesses by other bus masters. Depend- 
ing on the Bus Watch mode enabled and the type of 
main memory access (identified by decoding the 
CLIPPER Bus TG<2:0> and CT<5:0> lines, as 
described in Section 9, CLIPPER Bus), the CAMMU in- 
tervenes to update the cache with data that is written to 
main memory, to invalidate cache data, or to transfer up- 
dated data from the cache to a bus master that is read- 
ing main memory. This Bus Watch monitoring occurs in 
parallel with the memory access in order to eliminate or 
minimize the Bus Watch operation effect on CLIPPER 
Bus throughput. Each CLIPPER Bus master must 
generate the appropriate TG<2:0> tag when accessing 
the CLIPPER Bus. 


The three Bus Watch modes featured by the CAMMU 

are: 

(1) Watch CPU Writes (CPU writes to shared cacheable 
pages) 

(2) Watch I/O Writes (I/O writes to cacheable pages) 

(3) Watch I/O Reads (I/O reads from private copy-back 
pages) 


These Bus Watch modes are controlled by bits <3:1> of 
the CAMMU Control Register, as explained in Section 
7.6, Internal Registers. 


Watch CPU Writes 

Watch CPU Writes is enabled in a CAMMU to ensure 
that data in the CAMMU cache is updated with new 
data written by another CAMMU into its shared main 
memory pages, or to invalidate cache lines (cache data) 
in the case of quadword writes. 


The CPU transfers data to/from main memory via the 
D-CAMMU. The D-CAMMU transfers the data using 
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either single-word (byte, halfword, or word) transfers, or 
quadword transfers. When, with Watch CPU Writes 
enabled, a slave CAMMU (a CAMMU that is not access- 
ing the bus) detects a CPU write to one of its shared 
main memory pages by a master CAMMU (a D- 
CAMMU that is accessing the bus), the slave CAMMU 
determines whether the accessed location is in its 
cache, and whether the write involves one word for four 
words. If a single word write, the CAMMU updates the 
matched cache line. If a quadword write, the CAMMU 
does not update the matched cache line but instead in- 
validates the line by clearing the LV bit. 


Watch VO Writes 

Watch 1/O Writes, when invoked, functions identically to 
Watch CPU Writes. The two modes differ in that Watch 
1/O Writes responds to I/O writes rather than to CPU 
writes. 


When a CAMMU with Watch I/O Writes enabied detects 
an I/O write to one of its cacheable pages, the CAMMU 
determines whether the accessed data is cached in the 
CAMMU, and whether the write involves one word or 
four words. If a single-word write, the CAMMU updates 
the matched cache line. If a quadword write, the 
CAMMU does not update the matched line but instead 
invalidates the line by clearing the LV bit. 


Watch VO Reads 

Watch I/O Reads is enabled to ensure that data read by 
V/O devices from private, copy-back pages is always cur- 
rent data. This mode functions identically for both single- 
word and quadword I/O reads. 


When this mode is enabled in a D-CAMMU, the D- 
CAMMU monitors the system bus for reads by I/O of 
private, copy-back pages. When the D-CAMMU detects 
such a read, it searches its cache for the data. If the 
data is not cached or the cached data LD bit is clear, 
the I/O device reads the data directly from main 
memory. If the data is cached and the LD bit is set, the 
O-CAMMU aborts the assertion of data by main 
memory and asserts the current cache data on the 
CLIPPER Bus. The D-CAMMU thus intervenes in the 
I/O read operation to provide the more current cached 
data. If the Clear Valid bit in the Control Register is set, 
then the CAMMU will also clear the Line Valid bit in the 
cache line used to supply the data; the memory inter- 
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face can use the cache data to update its own contents, 
as described in Section 9.4.7. 


Note that because the copy-back caching mode applies 
only to private pages not shared by CAMMUs, this Bus 
Watch mode is invoked only during I/O reads of copy- 
back pages. 


7.6. Internal Registers 

Each CAMMU contains five software-accessible 
registers used for initialization and contro!. Two of these 
registers, the Supervisor PDO and User PDO, are used 
in address translation; they contain the base addresses 
of the supervisor and user Page Table Directories. The 
Fault register is loaded with the virtual address as- 
sociated with certain fault conditions and is used by the 
operating system to implement virtual memory. The Con- 
trol and Reset registers are used to control various 
aspects of CAMMU operation. These registers are dis- 
cussed in the following sections. 


7.6.1. Supervisor PDO Register 

The Supervisor PDO (Page Directory Origin) Register is 
a 20-bit read/write register that holds the base address 
of a Page Table Directory address which is used by the 
DTU during supervisor mode address translation (see 
Section 7.2.3, Dynamic Translation Unit). 


7.6.2. User PDO Register 

The User PDO (Page Directory Origin) Register is a 20- 
bit read/write register that holds the base address of a 
Page Table Directory address which is used by the DTU 
during user mode address translation (see Section 
7.2.3, Dynamic Translation Unit. 


Figure 28 CAMMU Control Register 


CLIPPER® C300 
32-Bit Compute Engine 


Advance Information 


7.6.3. Fault Register 

The Fault Register is a 32-bit read/write register which 
holds the virtual address of the data or instruction 
memory location that generated a page fault., a protec- 
tion fault, or an alignment fault. This register can be 
read by trap handler software to determine the faulting 
virtual address, and written to by the operating system 
to restore the context of a suspended faulting process. 


7.6.4. Control Register 

The Control Register is a 32-bit read/write register used 
to enable prefetching in the |-CAMMU, to selectively 
enable the Bus Watch modes, to assign a system tag to 
unmapped memory addresses, and to enable the clear- 
ing of cache line LV bits during Bus Watch of 1/O 

Reads. It is also used to enable the data or instruction 
alignment trap selectively in either CAMMU, and to indi- 
cate the CAMMU's vresion number.The Control Register 
is shown in Figure 28 and is described below. 


CID: CAMMU ID 

The 8-bit CID field contains the CAMMU's ID version 
number. This field is read only; writes to this field are ig- 
nored. 


ATE: Alignment Trap Enable 

The alignment tap enable flag is set by software to 
enable a trap when an unaligned data or instruction ac- 
cess is attempted. It is cleared by software to disable 
the alignment trap. See Sections 6.2.1 and 6.2.4 
describing data and instruction memory traps. On reset, 
this bit is set by hardware. 


31 24 23 109 8 7 6 5 43 2 1 0 
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NOTE: BITS <6:7> AND <23:10> MUST ALWAYS BE PROGRAMMED 0 OR UNDEFINED RESULTS WILL OCCUR. 
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CV: Clear Valid 

When this bit is set, the LV (Line Valid) bit in a copy- 
back cache line is cleared by hardware when the more 
current (dirty) data contained within that line is supplied 
by the CAMMU for an I/O quadword read (as a result of 
Bus Watch of /O Reads). This permits pages that are 
swapped by I/O back to disk to be simultaneously 
purged from the cache. Use of this option requires the 
memory interface to use the data sent to the I/O device 
to update its own contents, except in cases where the 
data will not be read by another I/O device (see Section 
9.4.7). The Clear Valid option is disabled by clearing 
this bit. On reset, this bit is cleared by hardware. 


Note: The Clear Valid bit was called the “Clear Dirty" bit 
in previous documents. 


UST: Unmapped System Tag 

When the Mapped Mode bit in the CPU System Status 
Word is clear, all CPU addresses, except supervisor vir- 
tual addresses 0 - 7FFF Hex which are mapped by the 
HTLB, are treated by the CAMMUs as real addresses 
requiring no translation. These unmapped addresses 
therefore have no TLB or Page Table source of system 
tags. The CAMMUs, therefore, use the two-bit UST field 
to tag pages referenced with unmapped addresses as 
follows: 


UST Description 
0 private, write-through, main memory space 
1 shared, write-through, main memory space 
2 private, copy-back, main memory space 
3 noncacheable, main memory space 


Figure 29 CAMMU Reset Register 
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The UST bits map to TG<1:0> CLIPPER Bus lines. 
TG<2> is forced to 0. UST is set to 3 by a reset. 


EWIR: Enable Watch I/O Reads 

EWIR, when set, enables Watch I/O Reads operation. 
This bit is ignored by the I-CAMMU. EWIR is set by 
CLIPPER Module reset. 


EWIW: Enable Watch 1/O Writes 
EWIW, when set, enables Watch I/O Writes operation. 
EWIW is set by CLIPPER Module reset. 


EWCW: Enable Watch CPU Writes 
EWCW, when set, enables Watch CPU Writes opera- 
tion. EWCW is set by CLIPPER Module reset. 


EP: Enable Prefetch 

EP, when set, enables |-CAMMU prefetching. When EP 
is clear, |-CAMMU prefetching is disabled, and the 
I-CAMMU fetches instructions "on demand.” The state 
of this bit is ignored by the D-CAMMU, which always 
fetches on demand. EP is set by CLIPPER Module 
reset. 


7.6.5. Reset Register 

The Reset Register is a 7-bit, write-only register that al- 
lows selective resetting of the CAMMU cache and TLB 
(see Figure 29). The cache LV (Line Valid) and U 
(Used) flags, and the TLB SV (Supervisor Valid), UV 
(User Valid), D (Dirty) and R (Referenced) flags can be 
cleared by setting individual Reset Register bits. 


6 5 4 3 2 1 0 
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NOTE: BITS <31:7> MUST ALWAYS BE PROGRAMMED 0 OR UNDEFINED RESULTS WILL OCCUR. 
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Figure 30 CAMMU Access Map 


VIRTUAL ADDRESS 


RESERVED Ox00004E00 — Ox00004F FF 


0x00004D81 — ox00004DFF 
0x00004D80 
0x00004041 — 0x00004D7F 
0x00004D40 
0x00004D11 — 0x00004D3F 
0x00004D10 
0x00004D09 — 0x00004D0F 
0x000040D08 
0x00004D05 — 0x00004D07 
oxo0004D04 
0x00004D00 — 0x00004D03 
Ox00004CFF 
Ox00004CFE 
0x00004CFD 
Ox00004CFC 
0x00004C04 — 0x00004CFB 
0x00004C03 
ox00004C02 
0x00004C01 
0x00004C00 


GLOBAL 
CAMMU 


0x00004881 — Ox00004BFF 
0x00004B60 
0x00004B41 — 0x00004B7F 
0x00004B40 
0x00004811 — 0x0000483F 
0x00004810 
0x00004B09 — 0x00004B0F 
0x00004B08 
0x00004B05 — 0x00004B07 
0x00004B04 
0x00004B00 — 0x00004503 
OxQ0004AFF 
Ox00004AFE 
0x00004AFD 
Ox00004AFC 
0x00004A04 — Ox00004AFB 
0x00004A03 
0x00004A02 
0x00004A01 
0x00004A00 


1-CAMMU 


0x00004981 — Ox000069FF 
ox00004980 
0x00004941 — 0x0000497F 
0x00004940 
0x00004911 — 0x0000493F 
0x00004910 
0x00004809 — Ox0000490F 
0x00004908 
0x00004905 — 0x00004907 
0x00004904 
0x00004900 — 0x00004903 
0x000046FF 
Ox000048FE 
0x000048FD 
0x000048FC 
0x00004804 — 6x000048FB 
0x00004803 
0x00004802 
0x00004801 
0x00004800 


0-CAMMU 


DESCRIPTION 
Reserved. 


Reserved. 

Global CAMMU, Reset register. 

Reserved. 

Global CAMMY, Control register. 

Reserved. 

Global CAMMU, Fault regiater. 

Reserved. 

Global CAMMU, User Page Directory Offset register. 
Reserved. 

Global GAMMU, Supervisor Page Directory Offset register. 
Reserved. 

Global CAMMU, TLB Line Set 63, X Line, VA Field. 
Global CAMMU, TLB Line Set 63, X Line, RA Field. 
Global CAMMU, TLB Line Set 63, W Line, VA Field. 
Global CAMMU, TLB Line Set 63, W Line, RA Field. 
Global CAMMU, Line Sets 1 through 62. 

Global CAMMU, TLB Line Set 0, X Line, VA Field. 
Globai CAMMU, TLB Line Set 0, X Line, RA Field. 
Global CAMMU, TLB Line Set 0, W Line, VA Field. 
Global CAMMU, TLB Line Set 0, W Line, RA Fieid. 


Reserved. 

-CAMMU, Reset register. 

Reserved. 

+CAMMU, Control register. 

Reserved. 

-LCAMMU, Fault register. 

Reserved. 

+CAMMU, User Page Directory Offset register. 
Reserved. 

I-CAMMU, Supervisor Page Directory Offset register. 
Reserved. 

-LCAMMU, TLB Line Set 63, X Line, VA Field. 
-CAMMU, TLB Line Set 63, X Line, RA Fleid. 
1-CAMMU, TLB Line Set 63, W Line, VA Field. 
|-CAMMU, TLB Line Set 63, W Line, RA Field. 
I-CAMMU, Line Sets 1 through 62. 

-LCAMMU, TLB Line Set 0, X Line, VA Field. 
L-CAMMU, TLB Line Set 0, X Line, RA Field. 
I-CAMMU, TLB Line Set 0, W Line, VA Field. 
-GAMMU, TLB Line Set 0, W Line, RA Field. 


Reserved. 

D-CAMMU, Reset register. 

Reserved. 

D-CAMMU, Control register. 

Reserved. 

D-CAMMU, Fault register. 

Reserved. 

O-CAMMU, User Page Directory Offset register. 
Reserved. 

D-CAMMU, Supervisor Page Directory Offset register. 
Reserved. 

D-CAMMU, TLB Line Set 63, X Line, VA Field. 
D-CAMMU, TLB Line Set 63, X Line, RA Fieid, 
D-CAMMU, TLB Line Set 63, W Line, VA Field. 
D-CAMMU, TLB Line Set 63, W Line, RA Field. 
D-CAMMU, Line Sets 1 through 62. 
D-CAMMU, TLB Line Set 0, X Line, VA Field. 
D-CAMMU, TLB Line Set 0, X Line, RA Field. 
D-CAMMU, TLB Line Set 0, W Line, VA Field. 
D-CAMMU, TLB Line Set 0, W Line, RA Field. 
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The Reset Register bits and their associated reset 
operations are as follows: 


Bit# Bit Name Reset Operation 
RU Reset All U Flags in Cache 
RR Reset All R Flags in TLB 
RD Reset All D Flags in TLB 
Reset All UV Flags in TLB 
RSV Reset All SV Flags in TLB 
RLVX Reset All *X" Line LV Flags in Cache 
RLVW Reset All “W" Line LV Flags in Cache 


O-ANWAAOD 
D 
Cc 
< 


The reset operations shown are performed by writing to 
the Reset Register with the appropriate data pattern. 


These CAMMU registers, as well as the CAMMU TLBs, 
are located in virtual page 4, which is translated by the 
Hardwired TLB into Page 0 of I/O space. A map of 
CAMMU I/O space is shown in Figure 30. 


The CPU accesses the D-CAMMU I/O space directly. 
The CPU accesses the I-CAMMU I/O space indirectly 
via the D-CAMMU, because the I-CAMMU/CPU Instruc- 
tion Bus is tied to CPU instruction buffers which only 
transfer instructions. 


7.6.6. CAMMU Register Access 


The D-CAMMU registers are located in virtual address 
4800-49FF (Hex). The I-CAMMU registers are located 


Figure 31 CAMMU Addressing 


1615 121110 


VA <31:0> 


CAMMU SELECT 


0 = D-CAMMU TLB 
1 = D-CAMMU REG 
2 = I-CAMMU TLB 

3 = |-CAMMU REG 
4 GLOBAL TLS 

5 = GLOBAL REG 

6 = RESERVED 

7 = RESERVED 


in virtual address 4A00-4BFF. These addresses are 
used to access registers in individual CAMMUs. 


The CAMMUs can also be addressed as a group using 
global addresses for TLB writes, register writes, and 
TLBy/cache reset. In systems utilizing multiple CLIPPER 
Modules, for example, a CPU can execute global writes 
to CAMMUs other than its companion D-CAMMU by ac- 
cessing virtual address locations 4Cnn (Hex, TLB write), 
and 4Dnn (Hex, register write, and TLB/cache reset). 
V/O devices can execute the global writes by accessing 
Cnn and Dann (Hex). 


Global writes are typically used in multi-CPU systems 
when main memory pages that are shared by more 
than one CLIPPER Module are replaced. If the virtual 
address of a page being replaced is identical for all 
modules sharing the page, a single global write to 
CAMMU I/O space can be used to invalidate the TLB 
entry corresponding to the outgoing page in all 
CAMMUs. 


Register Addressing 

CAMMU I/O space addresses are shown in Figure 31. 
Virtual address bits <31:11> comprise the CAMMU 
Base Address field, which must point to the upper half 
of virtual page 4 for CAMMU access. 


Virtual address bits <10:8> comprise the CAMMU 
Select field. This field identifies the following: 


TLB LINE SET SELECT 
X/W LINE SELECT IF 


(1 = X SELECT) TLB 
ACCESS 
VA/RA SELECT 
[= VASELECT) 





REGISTER SELECT 


04 = SUPERVISOR PDO REGISTER 
08 = USER POO REGISTER 


10 = FAULT REGISTER IF 
20 = RESERVED REGISTER 
40 « CONTROL REGISTER ACCESS 


80 = RESET REGISTER 
Ace 
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Bit No. CAMMU 
10 9 8 Selected Operation/Access 


0 0 0 D-CAMMU RW TLB 

0 0 1 D-CAMMU R/W Registers; Reset 
TLB/Cache 

0 1 0 I-CAMMU) R/WTLB 

0 1 1° %I-CAMMU RW Registers; Reset 


TLB/Cache 

1 0 0 Global Write TLB 

1 0 1. Global Write Registers; Reset 
TLB/Cache 


Note: 
The TLBs and caches are reset by writing to the Reset 
Register. 


The first four entries show individual !- and D-CAMMU 
addressing. The last two entries show global address- 


Figure 32 TLB Access Data Formats 


(a) TLB RA FIELD ACCESS FORMAT 


31 12~=«11 98 76 








RESERVED 


REAL SYSTEM 
ADDRESS TAG 


(b) TLB VA FIELD ACCESS FORMAT 
31 18 (17 


i 


VIRTUAL RESERVED 


ADDRESS 
TAG 


A061 


3 


2. (5 


ing, intended for use in systems utilizing more than one 
CLIPPER Module. A CPU uses global addressing in 
such a system to access a specific register or to reset 
the TLB and cache in all CAMMUs in the system other 
than its own D-CAMMU. 


Bits <7:0> of the virtual address comprise the Register 
Select field. This field identifies the register or the TLB 
field being accessed. All five CAMMU registers, and in- 
dividual VA (Virtual Address) and RA (Real Address) 
fields of the TLB can be addressed. 


if the operation is a TLB access, virtual address bits 
<7:2> address one of the 64 TLB entries, bit <1> ad- 
dresses the W or X line of the TLB entry, and bit <O> ad- 
dresses the VA or RA field of the addressed TLB line. 


If the operation is a register access, virtual address bits 
<7:0> address the registers as follows: 


2 1 0 


USED FLAG 
REFERENCED FLAG 


DIRTY FLAG 


PROTECTION 


C) 


—_ RESERVED 


USER VALID FLAG 


SUPERVISOR VALID FLAG 
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Bits Register 

<7:0> Addressed 
0000 0000 Reserved, Must Be Zero 
0000 0001 Reserved, Must Be Zero 
0000 0010 Reserved, Must Be Zero 
0000 0100 Supervisor PDO (read/write) 
0000 1000 User PDO (read/write) 
0001 0000 Fault (read/write) 
0010 0000 Reserved, Must Be Zero 
0100 0000 Control (read/write) 
1000 0000 Reset (write only) 


CAMMU Data Format 

The format of data written to and read from the 
CAMMuUs varies according to the register or TLB field 
addressed. Both the fields and the number of data bits 
used in the 32-bit data words written to the CAMMUs dif- 
fer to accommodate individual CAMMU registers and 
register types. 


TLB Access Data Format 

TLB aceess data formats are shown in Figure 32. Two 
formats are used. One format is used when accessing a 
TLB RA field; the second is used when accessing a 
TLB VA field. 


Figure 32A shows the data format used when accessing 
a TLB RA field. When accessing an RA field, the 
System Tag and Protection Level fields and the R and 


Figure 33. PDO Register Access Format 


31 


D flags of the addressed TLB line are also accessed, as 
well as the U flag of the TLB set containing the TLB line. 


Figure 32B shows the data format used when accessing 
a TLB VA field. The UV and SV flags of the TLB line are 
also accessed. Note that several data bits are not used. 
These bits are reserved by Intergraph and must be zero. 


PDO Register Access Data Format 

Figure 33 shows the data format used when accessing 
either the supervisor or the user PDO register. Bits 
<31:12> are used to transfer the 20-bit PDO data; bits 
<11:0> are reserved by Intergraph and must be zero. 


Fault Register Data Format 
The 32-bit address in the Fault Register is accessed as 
a 32-bit data word. 


Control Register Access Data Format 

The Control! Register is accessed as a 32-bit data word. 
Bits <23:10> are reserved by Intergraph and must be 
zero, 


Reset Register Access Data Format 

The seven least-significant bits are used when access- 
ing the Reset Register; bits <31:7> are reserved by 
Intergraph and must be zero. 


12 11 0 


PAGE DIRECTORY 
ORIGIN 


NOTE: THIS FORMAT IS USED FOR BOTH THE USER AND SUPERVISOR 


PDO REGISTER ACCESS. 
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8. CLIPPER Hardware Reset 

The CLIPPER C300 is reset when power is initially ap- 
plied to the module (power-on reset), and when RESET 
is asserted LOW during operation. 


The response of the CPU to a hardware reset is as 
follows: 


(1) The T flag in the PSW is cleared _and the BIG flag 
is set to indicate the state of the BIG pin. The 
remaining flags in the PSW are undefined. 

(2) The following SSW flags are cleared: El, TP, 

M, U, K, KU, UU, and P. The remaining 
flags are undefined, except for the ID field which is 
hardwired. 


The response of each CAMMU to reset is as follows: 


Figure 34 CLIPPER C300 Following Reset 
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(1) All LV (Line Valid) flags in the cache are cleared, 

(2) All U (Used) flags in the cache are cleared. 

(3) All UV (User Valid) flags in the TLB are cleared. 

(4) All SV (Supervisor Valid) flags in the TLB are 
cleared. 

(5) All D (Dirty) flags in the TLB are cleared. 

(6) All R (Referenced) flags in the TLB are cleared. 

(7) Bits <9:0> of the Control Register are set to 0x23F. 

(8) The Reset Register is cleared. 


Reset therefore places the CLIPPER C300 in un- 
mapped supervisor mode with all traps and conditional 
interrupts disabled and with Bus Watch and prefetching 
enabled. Figure 34 shows the state of the C300's CPU 
control registers, and the CAMMU's registers, TLB, and 
cache lines following reset. While RESET is asserted, 
all CLIPPER G300 Bus active LOW signals are pulled 
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HIGH (via pull-up resistors), and all active HIGH signals 
are forced LOW. BCLK continues clocking normally. 


RESET must be held tow for a minimum of 100 BCLK 
cycles after Von reaches Vop min when power is initially 
applied to the C300 (see Figure 35). This ensures ade- 
quate module reset time. It must be released in 
synchronization with BCLK. RESET must be held LOW 
for a minimum of 100 BCLK cycles when asserted 
during operation, and both the assertion and release of 
RESET must be synchronized with BCLK. 


The CLIPPER C300 executes diagnostic routines follow- 
ing release of RESET if URDIAG is asserted during the 
two BCLK cycles following the release of RESET (see 
Section 9.4.9, Diagnostics Control). Then it begins ex- 
ecution at supervisor virtual address 6000H, which is 
mapped by the HTLB to real address O of Boot space. 


9. CLIPPER Bus 


The CLIPPER C300 interfaces to external system 
devices and functional units such as main memory, I/O 


Figure 35 Reset Timing 


devices and peripherals, and other CLIPPER Modules 
via the CLIPPER Bus—a high-speed, synchronous bus 
designed to support multiple bus masters. 


The bus includes 32 bidirectional, multiplexed ad- 
dress/data lines which support byte, halfword, word, and 
quadword transfers. A separate interrupt bus allows fast 
interrupt management by the C300 with no ad- 
dress/data line loading or contention, thereby increasing 
the effective bus bandwidth. The bus protocol allows 
devices that are clocked at different rates to interface to 
the CLIPPER Module through the use of wait states as 
required, and bus arbitration to be centralized in a 
simple, fast bus arbiter. The bus supports Bus Watch, 
which monitors the bus and takes corrective action to 
ensure data consistency between the CAMMUs and 
main memory. 


The bus utilizes a single clock (BCLK), generated by 
the Clock Control Unit, for system clocking. All C300 
signal sampling and signal assertion are gated on the 
rising edge of this clock. A second clock signal, 2BCLK, 


‘bp 
|«—_—_——— 100 BCLKS MIN ————_ > | 





BUS @® 


| @ 
® 


URDIAG 


Notes: 


. CLIPPER Bus is inactive until first instruction fetch. 
. Internal CPU startup time. 


Aca4 


. RESET transitions must be synchronized with BCLK rising edges. 


CPU diagnostics execution if URDIAG was asserted during the 2 BCLK cycles following release of RESET. 
. Fetch from boot space. 


. URDIAG is asserted during RESET if CPU diagnostics execution prior to instruction execution is desired. URDIAG 
must remain asserted for at least 2 BCLK periods following release of RESET to assure recognition and can 
then either remain asserted or be released with no further effect on CLIPPER Module operation. 


ons oN = 
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with a frequency twice that of BCLK, provides additional 
clock edges for use in interfacing peripheral systems. 


All module outputs except BCLK are open drain and are 
tied to pullup resistors inside the module. These signals 
are tied to a 96-pin connector for interfacing to user- 
designed systems, where they may be buffered. 


The signals tied to the C300 connector constitute the 
CLIPPER Bus, shown in Figure 36. These signals are in- 


terfaced through buffers and logic devices as shown in 
Figure 37. Note that this interface includes ORed logic 


Figure 36 CLIPPER Bus Signals 


+— bia 





























———_ CBSYd ——® 0-CAMMU CACHE BUSY 
—— cBSvi —_———_» 1-CcAMMU CACHE BUSY 
—— Lock —____-» Bus Lock 

<— TR -————_» _ TRANSFER REQ (BUS ACTIVE CYCLE) 
ADYo!l ——__—_» READY OUTPUT I-CAMMU 
RDYo ———_» READY OUTPUT D-CAMMU 
<— ApYI 


6Ad ————_® BUS REQ D-CAMMU 


BRI ——-—__» BUS REQ I-CAMMU 








~t— MSBE/RETRY— SINGLE BIT ERROA/RETRY 























CLIPPER® C300 
32-Bit Compute Engine 


Advance Information 


and address/data signal transceiver 
control (DIR). 


The CLIPPER C300 Bus consists of the following 
groups of bus lines and signals: 


— Address/Data multiplexed lines used for address and 
data transfer 

— Cycle Type signals used to identify the number of 
bytes or words transferred during a bus operation, to 
identify the operation as a read, write, or global 
write, and to identify the bus master executing the 
operation as a CPU or an I/O device 


a ~~ AD *32 ———-® AODRESS/DATA 


BIG ENDIAN 
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Figure 37 CLIPPER C300 Bus Interface 
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— Memory Space System Tags used to identify — Interrupt vector bus 
address space being accessed and the caching — Bus protocol control lines 
policy which applies to the accessed data 
— Error signals used to report memory errors and bus An example CLIPPER C300 system implementation 
errors showing these signals is depicted in Figure 38. Table 12 
— Bus arbitration handshaking signals contains detailed descriptions of the CLIPPER Bus 


— Interrupt control lines 


Figure 38 Example CLIPPER System (Block Diagram) 
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Table 12 CLIPPER Bus Signal Descriptions 


Signal Type Description 

AD Vo ADDRESS/DATA. This is a positive logic (HIGH = logic 1) multiplexed address and data 
<31:0> bus which is tied to the CAMMUs. 

BIG I BIG ENDIAN. The state of this pin selects between the big endian and little endian byte or- 


dering. When BIG is tied to GND, big endian ordering is selected. When this pin is con- 
nected to Vcc, little endian ordering is selected. BIG must be configured before RESET is 
removed and must not be changed following the release of RESET. 


R oO DIRECTION CONTROL. This control signal can be used to control the drive direction of __ 
TTL tranceivers interfacing AD <31:0> to the CLIPPER Bus. A master CAMMU asserts DIR 
during an entire write operation and during the first two cycles of a read operation. A slave 
D-CAMMU asserts this signal when transferring data during an I/O read; a slave |-CAMMU 
asserts this signal when transferring data to a companion D-CAMMU. Drive direction is 
from CAMMU to the CLIPPER Bus when DIR is low. 


Q 


TG Vo MEMORY SPACE SYSTEM TAGS. These three CAMMU signals identify the address 
<2:0> space being accessed, the page type, and the caching policy which applies to the ac- 
cessed page. In addition, they signal two special operations, Cache Purge and Slave 1/O 
mode. System tags are derived from one of four sources. In mapped mode, they are 
selected during address translation from the TLB, the HTLB, or from page tables in main 
memory. In unmapped mode, TG<2> is zero and TG<1:0> is selected by the UST bits in 
the CAMMU Control Register. TG<2:0> tag encoding is as follows: 


TG2 G1 TGO Encoding 

main memory/private cacheable/ write-through 
main memory/shared cacheable/write-through 
main memory/private cacheable/copy-back 
main memory/noncacheable 

VO space/noncacheable 

boot space/noncacheable 

cache purge 

slave 1/0 mode/main memory/noncacheable 


44440000 
32-00-00 
=~O-0-0-0 
nun an anoe 


Note: The slave CAMMU can continually monitor the Memory Space System Tag and 
check for cache/main memory data consistency when the Bus Watch modes are enabled. 
The Bus Watch modes, when enabled, are invoked during the following CLIPPER Bus 
operations: 


(1) I/O writes to shared or private pages. 
(2) CPU writes to shared pages. 
(3) I/O reads from private copy-back pages. 


Frequently systems require the transfer of data between main memory and an I/O device. 
This type of data transfer is normally implemented by a CPU as a read operation into a 


CPU register, followed by a CPU write operation to the I/O device. This type of operation is 
accelerated by the CLIPPER Module through the use of slave I/O mode identified by the 
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Table 12 CLIPPER Bus Signal Descriptions (cont.) 





Signal Type Description 





Memory Space System Tag. The slave I/O mode allows an I/O device to capture data being 
read by the CPU during the read portion of the operation. The I/O device must be able to in- 
terpret TG = 7 as slave 1/O mode, then read the transferred data as it is being read by the 
CPU. The data read by the CPU is discarded. 





VO CYCLE TYPE. These six CAMMU signals indicate the type of CLIPPER Bus operation in 








CT 
<5:0> progress. CT<5:0> signal encoding is as follows: 
Signal State Operation 
CTS 0 CPU master. 
1 VO master. 
CT4 0 Write operation. 
1 Read operation. 
CT<3:2> 00 Word/halfword/byte transfer. 
01 Quadword transfer. 
10 Reserved. 
1 Global CAMMU write. 
CT<1:0> 00 Whole word transfer. If AD<1:0> are not equal to 0 and the access is 
not of CAMMU I/O space, an alignment trap will be signalled. 
01 Reserved. 
10 Byte transfer; AD<1:0> define the byte position. 
1 Halfword transfer; AD<1> defines the halfword position; if AD<0> 
is not equal to 0, an alignment trap will be signalled. 
Notes: 
(a) CT<1:0> have meaning only for single word transfers, with AD<1:0> pointing to 
transferred word/bytes. 
(b) In halfword/byte mode, the data must be written in the location specified by AD<1:0>. 
(c) During a quadword transfer, the master must assert AD<3:0> all 0 to point toa 
quadword boundary. 
CBSYVi, fe) CACHE BUSY (I-CAMMU, D-CAMMU). A CAMMU asserts CBSY to indicate execution of in- 
CBSYd ternal operations associated with Bus Watch operations. Main memory data must not be as- 
serted on the CLIPPER Bus while CBSYi or CBSYd is asserted. If a D-CAMMU asserts 
RDYo while asserting CBSY, indicating that it is asserting more recent cache data on the 
CLIPPER Bus, main memory must abort the data transfer operation. 
LOCK Oo BUS LOCK. LOCK is asserted by a bus master when it requires the CLIPPER Bus for more 





than one operation. LOCK is asserted by the CAMMUs during the following operations: 


(1) DTU Page Table Directory and Page Table accesses. 

(2) DTU R or D bit modifications in Page Tables. 

(3) Read-modify-write (test and set) operations. 

(4) Cache line replace and fetch on cache miss (quadword write followed by 
quadword read). 
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Table 12 CLIPPER Bus Signal Descriptions (cont.) 


Signal Type Description 


TR Vo TRANSFER REQUEST. TA is asserted by a bus master to indicate that a CLIPPER Bus 
operation is in progress. 


RDYi I READY INPUT. RDYi is tied to both CAMMUs. During read operations, the slave with the ad- 
dressed data asserts RDYi to indicate that it has asserted the data on the AD bus. During 
single word, byte or halfword write operations, the slave asserts RDYi to indicate that it has 
latched (read) the data. During quadword write operations, the slave asserts RDYi to indi- 
cate that it has latched the data word currently on the AD bus, and is ready to latch the next 
word. 


RDYo oO READY OUTPUT. RDYo is asserted by the D-CAMMU during Watch 1/O Reads operations 
to indicate to the I/O device that it has asserted data on the AD bus for reading. This occurs 
when the data location being accessed in main memory is cached, and the cache data is 
more “recent” than the main memory data. RDYo can be tied to RDY on the CLIPPER Bus. 


RDYoi fe) READY OUTPUT. RDYoi is asserted by the I-CAMMU when it is being accessed by its com- 
panion D-CAMMU. RDYoi is not interfaced to the CLIPPER Bus. 
nS 
BRi,BRd re) BUS REQUEST (I-CAMMU, D-CAMMU). These signals are asserted by the respective 
CAMMUs to obtain control of the CLIPPER Bus. 


BGi.BGd I BUS GRANT (Il-CAMMU, D-CAMMU). Bus Grant is asserted by the CLIPPER Bus arbitra- 
tion logic in response to a Bus Request by a CAMMU, and indicates that the requesting 
CAMMU has control of the bus. 





MSBE/ I MEMORY SINGLE BIT ERROR/RETRY. The main memory interface asserts MSBE/RETRY 
RETRY when it detects a corrected error in main memory during a read operation. (Typically, in sys- 





tems utilizing error correction, only single-bit errors are corrected.) MSBE/RETRY is tied to 
both CAMMUs, and is sampled by the CAMMUs when RDYi is sampled. A master CAMMU 
issues a trap to the CPU when it detects MSBE/RETRY asserted. The main memory inter- 
face must not assert an interrupt when it detects a corrected data error. The MSBE/RETRY 
signal must be presented to the CAMMU by the memory interface along with (during the 
same BCLK as) RDYi and the data to indicate a corrected error. MSBE/RETRY may not be 
asserted when RDYi is inactive during main memory accesses. 





The MSBE/RETRY signal is also used to abort and retry CLIPPER Bus operations. If the sig- 
nal is asserted during access of I/O space (TG=4) while RDYi is inactive, the current bus 
operation is aborted by the master CAMMU with no trap assertion to the CPU. 


Thus, if this pin is active during the same BCLK that RDYi is HIGH, an MSBE condition is 
recognized; if this pin is active during a BCLK when RDYi is LOW (for an I/O space access), 
a RETRY condition is recognized. 
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Table 12 CLIPPER Bus Signal Descriptions (cont.) 


Signal 





MMBE 


> 
a 
x 


2 


NMIACK 


RATE 


Type 
I 


1°) 


Description 


MEMORY MULTIPLE BIT ERROR. The main memory interface asserts MMBE when it 
detects an uncorrectable error in main memory during a read operation. (Typically, these will 
be multiple-bit errors, because in systems using error correction, only single-bit errors are 
corrected.) This signal must be asserted during the same BCLK cycle that RDY is asserted. 
MMEBE is tied to both CAMMUs, and is sampled by the CAMMUs when RDYi is sampled. A 
master CAMMU issues a trap to the CPU when it detects MMBE asserted. The main 
memory interface must not assert an interrupt when it detects an uncorrectable 

data error. 


BUS ERROR. BERR should be asserted by user-designed logic to indicate a CLIPPER Bus 
error condition such as a bus timeout. BERR is tied to both CAMMUs. Upon assertion of 
BERR, the master CAMMU terminates the bus operation and indicates to the CPU that the 
operation is completed. (If the bus error occurs during a read operation, the data asserted 
on the AD bus at the time BERR is asserted is transferred by the CAMMU to the CPU). The 
CAMMU does not issue a trap in response to a bus error; the bus error logic should assert 
an interrupt. 





INTERRUPT VECTORS. This is an inverted logic (LOW=logic 1) bus, tied directly to the 
CPU. It transfers interrupt vector numbers associated with interrupt requests. 


INTERRUPT REQUEST. This signal, tied directly to the CPU, is asserted by system devices 
for interrupt service requests. Once asserted, IRQ must remain asserted until IACK is as- 
serted by the CPU. An interrupt level and number must be asserted on IVEC<7:0> with 
each interrupt request. IRQ is maskable. 


INTERRUPT ACKNOWLEDGE. IACK is asserted by the CPU in response to an interrupt re- 
quest (IRQ) to acknowledge that servicing of the interrupt is in progress. 


NON-MASKABLE INTERRUPT. This signal, tied directly to the CPU, is asserted by system 
devices for non-maskable interrupt service requests. Once asserted, NMI must remain as- 
serted until NMIACK is asserted by the CPU. 


NON-MASKABLE INTERRUPT ACKNOWLEDGE. This signal is asserted by the CPU in 
response to an NMI request to acknowledge that servicing of the interrupt is in progress. 


BCLK RATE CONTROL. This input to the CLIPPER Module controls the CLIPPER Bus 
BCLK frequency. When RATE is tied to GND, BCLK frequency is 1/2 MCLK frequency. 
When RATE is tied to Voo, BCLK frequency is 1/4 MCLK frequency. For example if the 
standard 100 MHz crystal is used in the CLIPPER Module, BCLK frequency is 25 MHz if 
RATE is tied to GND, and 12.5 MHz if RATE is tied to Vpp. 
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Table 12 CLIPPER Bus Signal Descriptions (cont.) 


Signal Type Description 


BCLK oO BUS CLOCK. BCLK clocks all devices on the CLIPPER Bus. All signals must be clocked 
onto the CLIPPER Bus on the rising edge of BCLK; all signals must be latched/sampled 
from the CLIPPER Bus on the rising edge of BCLK. The propagation delay of signals 
asserted on the system bus must be less than one BCLK period and more than the BCLK 
skew between devices in order to ensure proper operation of the synchronous 
CLIPPER Bus. 


2BCLK Oo TWICE BUS CLOCK. This signal is twice the frequency of BCLK and in phase with it. 
2BCLK facilitates CLIPPER system design by providing additional clock edges for the 
synchronization of external logic and devices. 





RESET I RESET. This is the master reset signal which is asserted by system logic to reset the 
CLIPPER Module and other devices on the CLIPPER Bus. Upon release of RESET, the 
CPU begins instruction fetching at Boot space address 0. 


URE oO UNRECOVERABLE FAULT. This signal is asserted by the CPU to indicate that it has 
stopped program execution as a result of an unrecoverable fault condition. An un- 
recoverable fault occurs when the CPU encounters an error condition during execution of on- 
chip diagnostic routines, or when a trap occurs during the execution of INTRAP or rett. 


URDIAG I APPLY DIAGNOSTICS. This input to the CPU is asserted to force the CPU to execute on- 
chip diagnostic routines resulting in the following: 


(1) The diagnostics detected no error conditions. The CPU begins program execution at 
Boot space address 0 (supervisor virtual address 6000 hex). 
(2) The diagnostics detected an error condition. The CPU asserts URF and stops execution. 


RESET must be asserted when URDIAG is asserted. 


OSC | OSCILLATOR INPUT. This signal is used by the CLIPPER Clock Control Unit to derive all 
clocks on the CLIPPER Module. The active edge of OSC is the HIGH to LOW transition. 
a 
SYNC | SYNCHRONIZATION. This signal can be used to synchronize BCLK with devices on the 
CLIPPER Bus. When this input is not used, it can be tied HIGH, tied LOW, or left uncon- 
nected. The SYNC input is sampled on the HIGH to LOW transition of OSC. 
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Table 13 Signal Summary 








Signal | ___Mnemenic | InputOutput’ | __—Aotive State 
Address/Data Bus Pp DE HIGH 
Big Endian Pp BIG LOW 
LOW = output 
Memory Space System Tag ae a ee (eee HIGH 
Cycle Type HIGH 
Cache Bus | cpsycasy¢ | Oo HIGH 
Bus Lock <<] Se © LOW 
Transfer Request a ee ee - LOW 
Ready a | HIGH 
| RDYo,ADYoL =| OF HIGH 
Bus Request | BR,BRG =| HIGH 
Bus Grant | BG, BGs | HIGH 
Memory Single Bit Error/Ret MSBE/RETRY | 1 | LOW 
Memory Multiple Bit Error | MMBeE [| 14 | LOW 
Bus Error | BERR | LOW 
Interrupt Vector Bus VEC a ae LOW 
Interrupt Request << ee LOW 
Interrupt Acknowledge PACK | LOW 
Non-Maskable Interrupt PNM LOW 
Non-Maskable Interrupt Acknowledge | NMACK | oO. | LOW 
BCLK Rate Select” RATE ed HIGH = 80 ns 
LOW = 40 ns 
Bus Clock po BCLK | = 
Twice Bus Clock | aBLCK | = 
Master Reset | RESET, | LOW 
Unrecoverable Fault | URF | oOo | LOW 
Apply Diagnostics | URDIAG | LOW 
Oscillator Input ee ee = 
Synchronization | sync | = 


Notes: 


1. Inputs are designed with a nominal switching threshold of 1.3 V and are therefore referred to as TTL compatible. All 
outputs (excluding BCLK, 2BCLK and URF) are open drain structures with pull-up resistors (220 Ohms) to Vcc on the 
module. BCLK, 2BCLK and URF are standard CMOS output signals. If an external pull-up is used for URF, it must be 
at least 220 Ohms. Timing parameters are referenced to standard TTL levels. 


2. The BCLK period values shown are for an OSC frequency of 100 MHz. 
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signals, and Table 13 contains a summary of the bus 
signals. 


9.1. System Clock 

The C300 is clocked by an extemal oscillator signal, 
OSC. A Clock Control Unit derives from OSC both the 
internal clocks used on the Module and external clocks 
which can be used for system clocking. 


MCLK (Module Clock), one of the internal clocks, is the 
internal CLIPPER master clock, used to drive the CPU, 
the CAMMUs, and associated module logic. The fre- 
quency of MCLK is one half the frequency of OSC. 
Therefore, if the typical 100 OSC frequency is used, the 
MCLK frequency is 50 MHz. MCLK is not availiable for 
off-Module use. 


BCLK (Bus Clock) is the C300 system clock, used to 
clock devices interfaced to the CLIPPER Bus. The CLIP- 
PER Bus is synchronous: all data and control signals 
are asserted and sampled on the rising edge of BCLK. 
BCLK frequency is either one half or one fourth the fre- 
quency of MCLK, depending on the state of the C300 
Rate control pin. If RATE is tied to GND, BCLK frequen- 
cy is one half the frequency of MCLK; if RATE is tied to 
VGC, BCLK frequency is one fourth the MCLK frequen- 
cy. Therefore, assuming an OSC frequency of 100 MHz, 
BCLK frequency is either 25 MHz (40ns) or 12.5 MHz 
(80ns). BCLK is in phase with MCLK, with the LOW to 
HIGH transitions of the clocks occurring in phase with a 
skew of less than + 5ns. 


2BCLK (Twice Bus Clock) is an output signal having 
twice the frequency of BCLK. It provides additional 
clock edges for devices interfaced to the CLIPPER Bus. 
2BCLK is in phase with BCLK, with a skew of less than 
+ ins. 


9.2. System Configuration 

Any device (or functional unit) which meets the 
CLIPPER Bus protocol and electrical requirements 
(timing, threshold, and loading) can be interfaced to the 
CLIPPER Bus. Such devices include memory, I/O 
devices, and subsystems as well as the CLIPPER 
C300. A typical CLIPPER system configuration is shown 
in Figure 39. 


Up to 4 G-bytes of memory can be addressed by the 
CLIPPER C300 via its 32-bit address bus. This memory 
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can be interfaced directly to the CLIPPER Bus if re- 
quired. 


VO devices such as disk controllers, bus translators, 
data communications devices, and associated control 
logic such as bus arbitration units and interrupt control- 
lers, can also be interfaced to the bus. Such devices 
may be “off-the-shelf or user-designed. I/O devices are 
typically located in I/O space, but can also be located in 
main memory space. 


Figure 39 CLIPPER System 
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9.3. Definitions 9.4. Bus Protocol 
Several terms are used in the following text which may CLIPPER Bus operations are governed by the following 
not have universal meanings. These terms and their rules: 


defintions as used in this text are as follows: 
(1) A bus master cannot introduce wait states. This 


Master: requires that a bus master be able to transfer data at 

A device which has control of the CLIPPER Bus. A the maximum rate allowed by the bus protocol. 

master gains contro! of the bus by asserting BR (Bus (2) Slaves may introduce wait states by delaying the 

Request), then receiving BG (Bus Grant) from bus assertion of RDY (Ready) on the CLIPPER Bus. 

arbitration logic. Wait states can be introduced between the address 
and data cycles of an operation by delaying RDY 

Slave: and between data words in a quadword transfer by 

A device that is being addressed via the CLIPPER toggling ROY. 

Bus. A slave is addressed by a master. (3) All CLIPPER Bus signals must be sampled on the 
positive edge of BCLK. 

Memory interface: (4) All signals must meet required set-up and hold 

Logic which controls data transfer to and from main times with respect to the positive transition edge of 

memory. BCLK.Signals must not transition within the Tsu set- 
up time of BCLK rising edge or undefined states 

VO Write: within the C300 can result. 

Write by an I/O device. (5) If RDYo (Ready out) is asserted on the CLIPPER 
Bus by a CAMMU while CBSY is active, the memory 

VO Read: interface must abort its data transfer. 


Read by an I/O device. 


Figure 40 Cache Line Replacement 
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9.4.1. Bus Arbitration 

A bus arbitration unit which arbitrates control of the 
CLIPPER Bus must be implemented in systems utilizing 
the CLIPPER C300. The unit must be capable of receiv- 
ing bus requests from each of the possible bus masters 
via Bus Request lines (BRx, where "x" identifies a par- 
ticular bus master), and must be able to assert a Bus 
Grant (BGx) for each bus master. The unit may support 
priority assignment such that in cases of multiple re- 
quests for the bus, the bus arbitration unit grants the 
bus to the highest-priority requesting device. 


Abus master should hold BRx asserted during its entire 
access of the CLIPPER Bus, then should release BRx 
as soon as possible after completion of its data transfer 
in order to maintain high system throughput. The bus ar- 
bitration unit should hold BRx asserted until the bus 
master has released BRx. 


Multiple Bus Operations 

A bus master _can execute multiple bus operations by 
holding its BRx signal! asserted until it has completed all 
its data transfers. Read-modify-write operations, for ex- 
ample, require that the bus masters executing the opera- 
tions maintain contro! of the bus during the reads and 
following writes, and the bus master maintains this con- 
trol by holding BRx asserted until after completion of the 
write. Another example of a multiple-bus operation is 
the replacement of a cache line as a result of a cache 
miss. As shown in Figure 40, the operation consists of a 
quadword write of the cache line to memory if the line is 
dirty, followed by a quadword read of the replacement 
line into the cache. 


9.4.2. Bus Control 

The bus control signals indicate CLIPPER Bus opera- 
tion status which is used to implement bus protocol and 
control, support Bus Watch, and give a bus master the 
means to secure the bus indefinitely in order to com- 
plete multiple bus operations. 


A Ready Input (RDYi) tied to each CAMMU is used to 
synchronize data transfers between CLIPPER, I/O, and 
memory. When a CAMMU reads data, it holds the bus 
in a read state until the responding device asserts 
RDYi, indicating that the data to be read is on the bus. 
When a CAMMU writes data, it provides data on the 
clock following the address phase of the operation until 
the device being written to asserts RDYi, indicating that 
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it has latched the data. RDYi is thus used to accom- 
modate various response times of devices on the bus. 
This eliminates the need to introduce for all data trans- 
fers the number of wait states necessary to accom- 
modate the slowest device on the bus. 


Ready Out (RDYo) is asserted by the D-CAMMU during 
Bus Watch operation in response to an I/O read of 
memory data that is cached. RDYo is active only during 
this operation. If the memory page being read is tagged 
as a copy-back page, then changes to the page data in 
the cache are not copied to the page in main memory 
until the page is replaced by the operating system. If an 
I/O device reads data from memory that is cached, and 
if the cache has updated data that has not been copied 
to the memory location being read, the D-CAMMU as- 
serts RDYo while asserting CBSYd. This aborts asser- 
tion of memory data. (The memory interface must be 
designed to abort the memory operation when both 
CBSYd and RDYo are asserted.) The D-CAMMU in- 
stead asserts the updated cache data on the CLIPPER 
Bus, which is read by the I/O device, and RDYo. In this 
way, transfer of valid data to I/O devices is assured. 


Ready Out I-CAMMU (RDYoi) is asserted by the 
I-CAMMU to indicate assertion or latching of data in 
response to access by the D-CAMMU. Since only the 
D-CAMMU can access the I-CAMMU, this signal is tied 
only to the D-CAMMU. 


Two Cache Busy signals, one for the |-CAMMU (CBSYi) 
and one for the D-CAMMU (CBSYdq), are used to indi- 
cate CAMMU internal operations associated with Bus 
Watch. CBSYi and CBSYd may be ORed to form a 
single Cache Busy (CBSY) signal on the CLIPPER Bus 
as shown in Figure 37. When a CAMMU Bus Watch 
mode is invoked during a memory access, the affected 
CAMMU asserts Cache Busy to indicate that it is check- 
ing whether the accessed data location is cached (see 
Figures 55 and 56). 





If the bus operation is a write, the memory interface 
must not assert RDYi until CBSY is released by the 
CAMMU. This ensures that the CAMMU has time to up- 
date data in its cache before the bus operation is com- 
pleted. If the operation is a read, the memory interface 
must not drive the drive the Address/Data bus until 
CBSY is released. This allows the CAMMU to abort 
assertion of data by the memory interface, and to 
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provide cached data if required (see Ready Out 
description). 


Transfer Request (TR) is asserted by bus masters to in- 
dicate that CLIPPER Bus operations are in progress. 
While Transfer Request is asserted, no bus master 
other than the one controlling the bus can gain bus ac- 
cess. 


Lock (LOCK) is used in dual-bus applications in which 
the CLIPPER Bus is interfaced to a separate I/O bus 
through a bus adapter or a dual port memory, this 
provides CLIPPER Bus masters with a means of main- 
taining control of the I/O bus or dual port memory 
throughout successive bus operations. LOCK becomes 
active during DTU page table access, cache line re- 
placement, and read-modify-write operations. 





Figure 41 Single Word Read (1 Walt State) 
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Direction Control (DIR) is used to control drive direction 
of TTL transceivers buffering the C300 Address/Data 
bus. This signal controls the transceivers with proper 
timing for all bus operations, eliminating the need for 
such logic in the system. 


9.4.3. Memory Errors 

Memory data errors are reported with the Memory 
Single Bit Error/Retry (MSBE/RETRY) and the Memory 
Multiple Bit Error (MMBE) signals. These signals are as- 
serted by error detection and correction logic within the 
memory interface to indicate that a single-bit error has 
been detected and corrected (MSBE/RETRY), or that 
an uncorrectable multiple-bit error has occurred 
(MMBE). The signals, tied directly to the CAMMUs for 
fast response, force traps to error-handling routines. 
Timing for MSBE/RETRY and MMBE is shown in Figure 
41, Note that the signals are asserted during the same 
cycle that RDYi is active. 
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NOTES: 
1. Timing for SINGLE WORD READ with NO WAIT STATES 
is shown in Figure 53. 
2. These signals are asserted here by the memory interface 
to indicate memory data errors. 
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In cases of multiple-bit and single-bit errors, the 
CAMMU Fault Register does not capture the addresses 
causing the errors. It is therefore necessary to design 
an address snapshot register into the system to capture 


‘addresses for use by the trap routine servicing single-bit 


errors if analysis of the errors is required. 


The MSBE/RETRY signal is also used to abort and 
retry CLIPPER Bus operations. If the signal is asserted 
during access of /O space (TG = 4) while RDYi is inac- 
tive, the current bus operation is aborted and retried 
with no trap assertion. This feature is intended to 
resolve Bus Lockout in dual-bus systems, which occurs 
when a CLIPPER Bus master and an I/O processor 
(IOP) bus master simultaneously request access to 
each other's buses. For example, if CLIPPER has con- 
trol of the CLIPPER Bus for attempted access of the I/O 
bus at the same time that an /O bus master has control 


Figure 42 Bus Retry (Single Word Read Example) 
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of the I/O bus for attempted access of the CLIPPER 
Bus, each bus master waits for the other to release its 
bus. Each bus master is therefore “locked out" from the 
other bus until one of the masters is forced to release 
its bus. Simple logic in the interbus interface logic can 
be used to assert MSBE/RETRY whenever a CLIPPER 
request for the I/O bus occurs simultaneously with an 
IOP request for the CLIPPER Bus. This forces the 
CLIPPER to abort its bus operation and release the 
CLIPPER Bus, then re-arbitrate access to the CLIPPER 
Bus for a retry of the aborted operation. The IOP can 
gain access to the CLIPPER Bus after the abort by the 
C300 but before the retry, thus eliminating the Bus Lock- 
out condition. The CLIPPER then waits for completion 
of the I/O operation before gaining access to the I/O 
bus for the retry. Timing for bus retry is shown in Figure 
42. 


ABORTED ee os «RETRY — 
Tarb | Tadd1 | Tadd2 | | rare] 


BCLK 


2BLCK 





= Ec wae 0 ee oe 


AD 











RDYI 


MSBE/RETRY 
NOTE: 


eas 














\ f Alas 


Retry occurs when MSBE/RETRY is asserted while RDYi is released 


during access of I/O space only (TG = 4). 
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In summary, the Memory Single Bit Error/Retry signal 
operates as follows: 


1. If the signal is asserted during any time other than ac- 
cess to I/O space and during the same clock cycle that 
RDYi is active, the signal reports a corrected memory 
single-bit error. This causes the CAMMU to generate a 
trap to the CPU. 


2. If the signal is asserted during access of I/O space 
(TG = 4) while RDYi is inactive, the current bus opera- 
tion is aborted and retried by the master CAMMU with 
no trap assertion to the CPU. 


9.4.4. Bus Error 

A bus operation can be aborted by the assertion of 
BERR (Bus Error) by user-designed logic implemented 
in the CLIPPER system (see Figure 43). Bus error con- 
ditions should be detected by the bus error logic, which 
should then assert BERR and an interrupt request (via 
the interrupt logic). The CAMMU terminates the system 
bus access and releases the bus when it detects the 





Figure 43 Bus Error 
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assertion of BERR. The CPU should use the interrupt re- 
quest to vector to a routine designed to resolve the bus 
error condition. 


BERR must be asserted by the bus error logic for one 
BCLK cycle. The state of the CLIPPER Bus associated 
with the bus error may be stored by the bus error logic 
for use by the bus error interrupt service routine. 


9.4.5. Unrecoverable Fault 

Some errors allow no clean means of recovery for con- 
tinuation of program execution. These errors include the 
occurrence of a trap during execution of INTRAP or 
reti, and the detection of a fault during self-test (see 
Section 9.4.9). A trap during execution of INTRAP or 
reti can be avoided by ensuring that the Exception Vec- 
tor Table is set up prior to the occurrence of a trap con- 
dition, and that the supervisor stack pointer always 
points to a valid page. No other conditions generate an 
unrecoverable fault. 


Were the CPU to ignore these error conditions and con- 
tinue execution, effects on the system could be 
catastrophic. A faulty or "lost" CPU could execute ran- 
dom writes to memory and I/O, for instance, corrupting 
data in both main memory and secondary storage. The 
CLIPPER CPU offers protection from catastrophic 
failure by stopping program execution immediately upon 
detection of one of the unrecoverable fault conditions, 
before the system is corrupted. It then asserts the Un- 
recoverable Fault signal (URF) as a hardware indication 
that the CPU is halted due to an unrecoverable error, 
and that human intervention is required to correct the 
problem. 


9.4.6. Walt States 

Slow devices can introduce wait states by delaying 
assertion of RDY on the CLIPPER Bus during bus 
operations. Wait states consist of an integral number of 
BCLK periods during which time the master device 
remains in a “waiting” state until the slave device as- 
serts RDY to indicate that it has asserted data on the 
bus (if a read operation by the master), or has read 
data from the bus (if a write). Wait states are further ex- 
plained in the following descriptions of bus operations. 
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9.4.7. CLIPPER Bus Operations 

Unless otherwise noted, the signal nomenclature used 
in this section describe the signals as shown in 
Figure 37. 


— RDY is tied to RDYo to form a single ready signal 
(RDY). RDY is gated with RDYoi on the CLIPPER 
Module Interface. 

— CBSYd and CBSYi on the CLIPPER Module 
Interface are gated to form a single ORed CBSY 
signal on the CLIPPER Bus. 


ACLIPPER Bus operation begins when a bus master re- 
quests the bus by asserting its Bus Request signal and 
receives Bus Grant from the bus arbitration unit. The 
bus master can then execute one of four bus opera- 
tions: a read operation, a write operation, a global write 
operation, or a multiple memory access operation. 


Figure 44 Quadword Read (No Walt States) 
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Read Operation Ores 

Upon receiving Bus Grant (BGx), the master (possibly a 
CAMMU) asserts TR, Memory Space System Tag 
(TG<2:0>), Cycle Type (CT<5:0>), and a real address 
(AD<31:0>) on the bus. The Memory Space System 
Tag, Cycle Type, and TR signals remain asserted during 
the entire operation. However, the bus master three- 
states the multiplexed address/data lines (AD<31:0>) 
after two BLCK cycles to make the lines available for 
data transfer by the slave device (see Figures 41 and 


44), 


The bus master then waits for the slave device to assert 
RDY (CAMMU RDYi signal), indicating that the data is 
on the bus. The master latches the data on the same 
positive transition of BCLK that it detects assertion of 
RDY. The slave device can respond with data im- 
mediately after the address/data lines are three-stated 
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by the bus master, or can respond later as required by 
delaying assertion of RDY thereby introducing wait 
states. 


The minimum number of BCLK cycles required for a 
read operation is three, exctuding bus arbitration require- 
ments: two cycles are required for assertion of address, 
and at least one cycle is required for the data transfer. 


Bus Watch During Read Operations 

During I/O reads of private, copy-back main memory 
space (i.e., TG=2), each CAMMU with Watch I/O Reads 
enabled asserts CBSY, indicating to other bus devices 
that it is checking for dirty cached data (cached data not 
yet written to main memory) corresponding to the main 
memory location being accessed by the I/O device. If it 
finds dirty data, it asserts the data on the AD bus, as- 
serts RDYo, and releases CBSY. The '/O master must 
then latch the data on the positive BCLK transition fol- 
lowing assertion of RDYo. !f the data is not cached or 





Figure 45 Memory Interface CBSY Monitoring 
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the cached data is not dirty, the affected CAMMU 
releases CBSY, allowing transfer of main memory data 
to the bus master. This Bus Watch sequence applies to 
both single-word reads and quadword reads. If Bus 
Watch intervention occurs during quadword reads, 
however, the affected CAMMU will retum all four data 
words if one or more data words is dirty. 


The main memory interface must monitor the CLIPPER 
Bus CBSY and RDYo (which can be tied to the RDY sig- 
nal) lines and allow main memory data response on 
AD<31:0> only if CBSY (CBSYi or CBSYd on the 
CLIPPER Module Interface) is not asserted, indicating 
that there will be no CAMMU intervention resulting from 
CAMMU Bus Watch. If RDYo is asserted by the 
CAMMU while CBSY is asserted, the memory interface 
must abort the memory read because the CAMMU is 
responding with more recent cache data. Memory 
monitoring of CBSY is summarized in Figure 45. 
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(1) Single Word Writes: CAMMU updates cache on hit; memory interface must not assert RDY until after CBSY 


is released. 


(2) /O Reads: CAMMU provides data on cache hit; memory interface must not assert RDY until after CBSY is 
released, and may enter into memory data that is supplied by the CAMMU (indicated by assertion of RDY and 
CBSY by the CAMMU) in order to support Clear Dirty operation if required. 


(3) Quadword Writes: The memory interface proceeds normally (doesn't monitor CBSY) if the bus arbiter inhibits 
granting of the bus again while CBSY is asserted; otherwise the memory interface must not assert RDY until 





after CBSY is released. 
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The Clear Valid option, if enabled, requires memory to 
update its contents with the more current (dirty) data 
supplied by the cache for an I/O quadword read, unless 
the data will not be read by another I/O device (see Sec- 
tion 7.6.4). Use of this option saves a write of the dirty 
data to memory when the cache line is replaced. The 
Clear Valid option is enabled by setting the Clear Valid 
flag in the Control Register. Memory support for this op- 
tion requires that the memory transition from a memory 
read operation to a memory write operation when both 
CBSY and RDY are asserted by the D-CAMMU, and 
that the memory not be allowed any wait states be- 
tween the individual quadwords supplied by the 
CAMMU. 


CAMMUs normally require 4 MCLK (8Ons @ 100 MHz 


OSC frequency) cycles for Bus Watch checking. During 
this time the memory interface can proceed with the 


Figure 46 Single Word Write (1 Walt State) 
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read operation without delay up to, but not including, 
assertion of RDY. As a result of this parallelism, 
CAMMU Bus Watch operation results in little impact on 
CLIPPER Bus utilization. 


Write Operation 

Signal assertion and timing associated with a write 
operation are similar to those associated with a read 
operation. Upon receiving Bus Grant (BGx), the master 
(possibly a CAMMU) asserts TR, Memory Space Sys- 
tem Tag (TG<2:0>), Cycle Type (CT<5:0>), and a real 
address (AD<31:0>) on the CLIPPER Bus. The Memory 
Space System Tag, Cycle Type, and TR signals remain 
asserted during the entire operation (see Figures 46-48). 


After two BCLK cycles, however, the bus master 
replaces the address on the AD lines with the data to 
be written, and holds the data on the lines until the 
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slave acknowledges latching of the data by asserting 
RDY (CAMMU RDYi IDYi signal) on the CLIPPER Bus. The 
slave can assert RDY when ready, allowing wait states 
as required. 


If the operation is a single-word write, the bus master 
releases the bus immediately following detection of as- 
serted RDY (CAMMU RDYi signal). If the operation is a 
quadword write, the bus master asserts the second, 
third, and fourth data words of the quadword data trans- 
fer during successive BCLK cycles following detection 
of asserted RDY. The slave device can introduce wait 
states between assertion of the quadword address by 
the master and fatching of the first data word, and be- 
tween latching of the individual data words. 


The minimum number of BCLK cycles required for a 
write operation is three, excluding bus arbitration require- 


Figure 47 Quadword Write (No Walt States) 
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ments. Two cycles are required for assertion of the ad- 
dress, and at least one cycle is required for the data 
transfer. 


Bus Watch During Write Operations 

A CAMMU invokes Watch I/O Writes, if enabled, when 
an W/O device writes to its cacheable main memory 
space; and invokes Watch CPU Writes, if enabled, 
when a CPU (via a D-CAMMU) writes to its shared 
cacheable main memory space. Both Bus Watch 
modes, when invoked, function identically. if the write 
operation invoking one of these modes is a single-word 
write operation, the affected CAMMU updates the cache 
with the data written to the main memory if the main 
memory data has been cached. If the write operation is 
a quadword write, the affected CAMMU invalidates the 
cache line containing the data addressed in main 
memory. 
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A CAMMU normally requires 4 MCLK (8O0ns @ 100 
MHz OSC frequency) cycles to complete one of these 
Bus Watch operations. However, if the Bus Watch opera- 
tion occurs while the CPU is accessing the CAMMU, 

the CAMMU may require more time to complete the 
operation and will keep asserting CBSY to inhibit further 
bus operations until it has completed the task. The bus 
master, however, can complete the write operation while 
the CAMMU executes its Bus Watch operation, so Bus 
Watch impact on CLIPPER Bus utilization is 

minimal. 


lf CBSY is asserted during a byte, halfword, or word 
memory write operation, the memory interface must not 
assert ADY until after CBSY is released. This ensures 
that the data remains on the bus long enough for entry 
by a CAMMU into its cache in case of a hit. 


if CBSY is asserted during a quadword write, the 
memory interface can_ assert RDY normally without 
regard to the state of CBSY because in case of a cache 
hit, the affected CAMMU invalidates the hit line and 
does not require data to be present on the bus. 
However, the system bus arbiter must not grant the bus 
to a new bus master until CBSY is released, indicating 





Figure 48 Quadword Write (4 Walt States) 


that all CAMMUs are ready for a new operation. Alterna- 
tively, the memory interface can delay assertion of ROY 
until CBSY is released as in the byte/halfword/word 
write case, eliminating the need for the bus arbiter to 
monitor CBSY. In any case, a new CLIPPER Bus opera- 
tion should not be allowed to begin while CBSY is 
asserted. Memory monitoring of CBSY is summarized in 
Figure 45. 


Global Write Operation 

Aglobal write is used in a system utilizing multiple 
CLIPPER Modules to reset the TLBs or caches, or to 
write to specific TLB lines or registers in all CAMMUs in 
the system except the companion D-CAMMU of the 
CLIPPER CPU executing the global write. CLIPPER 
Module global addressing is explained in Section 7.6.6, 
CAMMU Register Access. Non-CLIPPER bus masters 
can execute global writes by setting CT<3:2> HIGH 
during the otherwise normal write operations. Note, 
however, that CAMMUs respond only to global writes to 
CAMMU I/O space real addresses Cnn and Dnn (Hex). 


Each CAMMU being written to by a global write asserts 
CBSYi (if an IL-CAMMU) or CBSYd (if a D-CAMMU) 
during the write to inhibit further bus activity until it has 
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completed internal tasks associated with the write. Sys- 
tem logic is required which detects global writes and as- 
serts RDY when CBSY is released. 


Read-Modify-Write Operation 
A read-modify-write bus operation is a combination of a 
read operation, followed by a write operation. Timing 
and protocol associated with the read and the write 
phases of a read-modify-write operation are the same 


as for single reads and writes; however, Bus Request 


(BR) must be asserted by the bus master during the en- 


tire operation. 


Read-modify-write operations are performed during ex- 
ecution of the tats (test-and-set instruction). However, 

the write part of the read-modify-write is performed only 
if the bit to be tested is zero; if the bit has already been 
set (AD<31> = 1), the bus operation is terminated. 


Timing for this operation is shown in Figure 49. 


Read-modify-write operations are also performed during 
address translation when the DTU accesses main 


Figure 49 Read-Modify-Write (Test and Set) 
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memory twice to read the Page Table Directory and 
Page Table in main memory, and follows with a write to 
the Page Table if the Referenced or Dirty flags must be 
modified. Timing for this operation is shown in 

Figure 50. 


9.4.8. Interrupt Bus 

The CLIPPER Bus includes a separate interrupt bus, 
IVEC<7:0>, tied directly to the CPU. This bus allows in- 
terrupt levels and numbers to be transferred to the CPU 
without regard to CLIPPER Bus activity, thereby reduc- 
ing CPU interrupt response time and increasing effec- 
tive CLIPPER Bus bandwidth. (See Section 6.3, 
Interrupts.) 


An interrupt controller must be implemented in a 
CLIPPER system. In cases of multiple interrupt re- 
quests, it must select between the interrupts, asserting 
the interrupt with highest priority. The interrupt controller 
must assert an interrupt request and its associated inter- 
rupt vector number together on the same positive transi- 
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tion edge of BCLK. The interrupt vector number can 
change to a higher priority on any BCLK. The CPU 
uses the interrupt level and number present on the 
IVEC bus when it detects IRQ release on a rising edge 
of BCLK, then releases IACK during the following BCLK 
period. 





9.4.9. Diagnostics Control 

The CLIPPER C300 executes diagnostic routines follow- 
ing release of RESET if Apply Diagnostics 

(URDIAG) is asserted during the two BCLK cycles fol- 
lowing the release of RESET. Then it begins execution 
at supervisor virtual address 6000H, which is mapped 
by the HTLB to real address 0 of Boot space. 





The state of URDIAG during the two BCLK cycles follow- 
ing release of RESET determines whether the C300 
CPU executes intemal diagnostics before executing 

from boot code (see Figure 35). This is a powerful fea- 
ture of the module which allows self test of major func- 
tions of the CPU without test equipment, and without 
removal of the chip. Failure during diagnostics is 
reported by assertion of the Unrecoverable Fault (URF) 
CLIPPER Bus signal. 


The C300 self test checks most, but not all, of the major 
functions of the CPU. It is intended to be a first-level 
check of the CPU, and in fact is used to initially test in- 
dividual CPU die during fabrication. The test executes 


Figure 50 Read-Modity-Write (DTU Operation) 


EAD PAGE TABLE DIRECTORY 





approximately 700 instructions in about 4500 MCLK 
periods, using operands which test the CPU under 
worst-case conditions where possible. For example, 
worst-case carries, overflows, and sign extensions are 
tested. 


The following CPU operations and functions are tested: 


— Pipeline resource management 

— Integer and Floating-Point Execution Units 

— General-purpose register files 

— Integer bypass mechanism 

— Transition between supervisor and user modes 

— Temporary (hidden) registers 

— Macro branches 

~~ All addressing mode computations 

-—— Arithmetic shift, logical shift, and rotate instructions 

— Integer multiply and divide 

— Single- and double-precision floating-point 
instructions 

— Floating-point status bits 


CPU operations which require external response to in- 
struction execution are generally not tested. These in- 
clude exception conditions, branches, loads, stores, 
pushes and pops, and |-CAMMU and D-CAMMU 
interfaces. 

Note: Diagnostics are designed to run in little endian 
mode (BIG tied HIGH) only. 
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9.4.10. Bus Timing 

Figures 51-66 show CLIPPER Bus signal timing and 
test loads. Values for parameters indicated in the 
figures are listed in Tables 15 and 17. 


BCLK is CMOS-compatible. All timing relationships in 
the timing figures are referenced to the 1.5 V midpoints 
of BCLK positive transitions. 


The following are definitions of terms used in the figures: 


Tarb (arbitration time) 
BCLK cycle used for bus arbitration 


Tadd1 (address time 1) 
First BCLK cycle during which address is asserted 
on the bus 


Tadd2 (address time 2) 
Second BCLK cycle during which address is as- 
serted on the bus 


Td (data time) 

BCLK non-wait state cycle during which data is as- 
serted on the bus. For a quadword transfer, a 
numerical subscript (e.g., Td2) indicates which data 
word is asserted. 


Tw (wait state time) 
BCLK cycle during which the CLIPPER Module is in 
a wait state. 


9.4.11. CLIPPER C300 Module Configurations 
There are three CLIPPER C300 Module configurations, 
shown in Figures 67 - 72. 


The C300 Module with standard connector is shown in 
Figure 67. \ts connector mates with a user-supplied type 
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Parameter 

Supply Voltage 

DC Input Voltage Per Pin 
Storage Temperature Range 






osweo | 
Dewvewsv fv 
2010 05 





BIC-Vero 905-72178F, or equivalent, male connector. 
Note that the numbering on the male connector is 
reversed relative to the C300 connector. 


The C300 Module with box connector (Figure 69) mates 
with a user-supplied Samtec SD-125-T-18, or a Mc- 
Kenzie SBU-2X25-STGT-D131-VL! female socket con- 
nector, or the equivalent. 


The C300 Module with dual connector (Figure 77) 
mates with a user-supplied McKenzie PH1-225/100 - 
32G male connector or the equivalent. 


9.4.12. Oscillator Connection 

An extemal oscillator must be provided by the user to 
drive the clock control chip on the CLIPPER Module. 
The oscillator frequency must be twice the required 
MCLK frequency, with a duty cycle between 60/40 and 
40/60. The oscillator should be placed as close to the 
connector as possible. 


9.4.13. System Synchronization 

The SYNC input signal can be used to synchronize 
BCLK with devices on the CLIPPER Bus. For example, 
in applications using multiple~-CLIPPER Modules, this 
input can be used to ensure that the BCLKS of all 
Modules are in phase. The OSC inputs of all the 
Modules must be connected to a single oscillator, and 
all the RATE pins must be in the same state. 


The SYNC input is sampled on the HIGH to LOW tran- 
sistion of OSC. To initiate synchronization of all Modules 
in the system, the SYNC input must be driven LOW for 
at least two OSC periods, then driven HIGH for the fol- 
lowing two OSC periods. Synchronization occurs in 
such a manner as to stretch all the clock inputs rather 
than shrink them. 


*Absolute maximum ratings are those values beyond which damage to the device may occur. Functional operation 


under these conditions is not implied. 
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Table 15 AC Characteristics Vcc = 5.0 V + 5%, Tc = 0 to +70°C 


ee ec 
[| SetupTime 


Setup Time ae ee ee 
Hold Time ze 2a ae 
Clock to Transition Time ie mee | eee! 










Notes: 

1. Transition, rise, and fall times are for a SOpF external capacitive load (see Figure 64 ). 
2. All outputs except BCLK and 2BCLK. 

3. To guarantee set-up times, the input signals must have rise and fall times < 4ns. 


Table 16 DC Characteristics Vcc = 5.0 V + 5%, Tc = 0 to +70°C 


Input HIGH Voltage 
Input LOW Voltage 
Voc = 4.75 V 


Output HIGH Voltage! 1oH = -20pA 


Voc = 4.75 V 
lou = -2mA 


Vec = §.25 V 
Voc = 5.25 V 


Input Leakage Current Vin = 0 to 5.25 V_ Inputs Only 


Input Capacitance 


Input HIGH Current Rp = 220 ohms | Bidirectional 1/O Only 
Input LOW Current Rp = 220 ohms _ | Bidirectional /O Only Vu = 0.55 V 


Bidirectional I/O 
Supply Current Ta = 0°C, Voc = 5.25 V 


Power Dissipation fosc = 100 MHz 
BCLK load = 100pF 





1. lon, lo and Ii parameters are a function of the value of Module pull-up resistor Rp. 
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Table 17 AC Characteristics - OSC, BCLK, 2BCLK Vcc = 5.0 V + 5%, Ci = 200pF, Tc = 0 to +70°C 


Characteristic Conditions [40 MHz c300_ | 50 MHz C300 | 
= Boe recs 

| tose _| 

[ tosc _| 







Oscillator Frequen ee 
Oscillator Cycle Time —— aaa 


toscL 8 






BCLK Cycle Time RATE = LOW 
RATE = HIGH | 100 | 
BCLK Pulse Width tc =Min(RATE=LOW) | 19 _| 
tc = Min (RATE = HIGH) 


tr BCLK Rise and Fall Time Cx = 200pF 
Ci = 100pF 
tg2B BCLK to 2BCLK Skew oe 


teBR 2BCLK Rise and Fall Time CL = SOpF 


2BCLK Cycle Time RATE = LOW 
RATE = HIGH 


2BCLK Pulse Width tesc = Min (RATE = LOW) 
tepc = Min (RATE = HIGH) 
Note 


BCLK rise and fall times are for a 100pF capacitive load (see Figure 65 ). This load should not be exceeded to 
ensure proper operation. 


tc 


tcH 





af 


ogi 
nm — 





9 


Table 18 DC Characteristics - BCLK, 2BCLK Vcc =5.0V + 5%, Tc = 0 to +70°C 


lon = +100mA Voc -0.45 


ViL Input LOW Voltage 





88 


This Material Copyrighted By Its Respective Manufacturer 


CLIPPER® C300 
32-Bit Compute Engine 


Advance Information 


Figure 51 AC Measurement Points 


BCLK 1.5V 


ee oF Ts | 


INPUT 15v | 


Tco —| - 
OUTPUT 1.5V ‘(ce 
0.4V 0.4V 


we ee 


Figure 52a AC Measurement Points, OSC 


Lo eee 


osc 
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Figure 52b AC Measurement, BCLK and 2BCLK 


—i. 


Figure 520 AC Measurement, OSC and SYNC 


6s ae a ae 
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Figure 53 Read Timing Diagram 


Tadd2 Td 





— |— Teo Tco —+| -— 
bg | | Teo Tco —4 | 

" XK nervecvoe | 
SS a kK TH 

RDYi ee A ce 


A157 
NOTES: 
1. Timing measurements are referenced as specified in Fig. 51. 


2. I-CAMMU internal registers can be accessed through the companion D-CAMMU 
(D-CAMMU of same CLIPPER Module). 
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Figure 54 Write Timing Diagram 


| Tarb | Tadd1 | Tadd2 | Td | 


° 


— = 
_ 2 
— TH 


=i 
"7. 7 
ae t| ‘ Lp 


~ 


TG, CT MEMORY SPACE SYSTEM TAG, CYCLE TYPE 
r= Tco ‘ | 
TR * # 
ACTIVE CYCLE 
Tsu —| — | TH 
RDYI ———-- - ee ees, ll. ae 


A158 


NOTE: 
Timing measurements are referenced as specified in Fig. 51. 


92 


This Material Copyrighted By Its Respective Manufacturer 


CLIPPER® C300 
32-Bit Compute Engine 


Advance Information 





Figure 55 Watch /O Reads 


| Tadd1 | Tadd2 | BUS WATCH | Td | 


Sir jie 


| | 
“mdf eee ee ee 


1™G,CT VO READ, COPY-BACK SPACE 
mF 
TR | ee ae ae 
Tco te = Tco 
CBSYd a CACHE MEMORY 
ag kK 
ke Teco 


RDYo (HIT*DIRTY) 








A159 


NOTES: 
1. VO READ: Device reads from main memory (e.g., main memory to disk). 
2. Timing measurements are referenced as specified in Fig. 51. 
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Figure 56 Watch CPU and I/O Writes 


| Tadd | Tadd2 | «BUS WATCH | Tw | Td | 


tf ae NP Nag a 
| 


tu —f kf} TH 


AD ADDRESS DATA / 


TG,CT vo =m WRITE-THROUGH OR COPY-BACK 
, ITE, SHARED WRITE-THROUGH 
Tsu — 


ET 


cBsYid = laa i ‘= is | 


A160 











NOTES: 


. VO WRITE : VO device writes into main ere .g., disk to main memory). 
. WRITE TO SHARED PAGE : One of the CAMMUs writes ad the shared area of main memory. 
8. Timing measurements are referenced as specified in Fig. 5 
ADYi is asserted by the memory interface. 
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Figure 57 D-CAMMU Read from Companion !-CAMMU 


| Tarb | Tadd1 | Tadd2 | CBSY | td | 


sx *F \ FF \F VF VF \ 
=} fetes | | teal be ; 


ee a7 = f =|osla 


(D-CAMMU) 






DIR 


ae 


=i tee 





Teo Teo —| be 
ADDRESS 
Teo | 





AD 





1G, CT 1 x MEMORY SPACE SYSTEM TAG, CYCLE TYPE 
Teo le - 


Teo fh “i —1 [t- Tee 


RDYoI et 
ee: ee. cease 





RDYI 
A167 


NOTES: 

1. Timing measurements are referenced as specified in Fig. 51. 

2. 1-CAMMU internal registers can be accessed through the companion D-CAMMU 
(D-CAMMU of same CLIPPER module). 
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Figure 58 D-CAMMU Write Into Companion -CAMMU 


| Tarb | Tadd1 | Tadd2 | CAMMU BUSY | Td | 


BCLK 


Tco | 
R oo ee 
= ey 
Nig Tco Teo ee) je | . | — Teo 
= | eee 
TG, CT MEMORY SPACE SYSTEM TAG, CYCLE EE 


|e—Teo 
—, a eae 


ACTIVE ath 


Teo = | =| _ [ete 
CBSYI 








RDYol | 
Tsu 
=a ke TH 
RDYI 
A162 
NOTES: 
1. ba measurements are referenced as specified in Fig. 51. 
2. I-CAMMU internal registers can be accessed through the companion D-CAMMU 


(D-CAMMU of same CLIPPER module). 
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Figure 59 Maskable Interrupt Request/Acknowledge Timing 


= ae a eo a a 


Tsu — | 


a eee f T 


rode 4 yale 


CPU LATCHES IVEC 


NOTES: 
1. After TACK returns high the TRO line must be high for one clock before another interrupt 


request retums IRQ low. 

2. The IVEC lines can change only to a higher priority when IRQ is low. The higher priority value 
must be on the IVET lines by “a”. 

3. CPU latches IVEC on the rising edge of BCLK following release of TRO. The CPU releases 


“TACK during the BCLK period following release of IRQ. 


4. Timing measurements are referenced as specified in Fig. 51. 
ates 


Figure 60 Non-Maskable interrupt Request/Acknowledge Timing 


aif | Ae 


sei ry, ke 
NMIACK | xy y 


NOTES: 
1. After NMIACK retums high the NMI line must be high for one clock before another 


non-maskabie interrupt request returns NMI low. 
2. Timing measurements are referenced as specified in Fig. 51. 
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Figure 61 LOCK Timing 


= = — hemes 


Figure 62 URF Timing 


Figure 63 RESET and URDIAG Timing 


mx fT NEN EN 
—+| as Teu —| = Tn 
RESET 


—| j-a—— Tou Tr | j-— 


URDIAG \ , alte 


Note: URDIAG should not be asserted if BIG is ted high. 
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Figure 64 Module Output Test Load 


1.8K 


IN914 





SS SSeS - A120 


Figure 65 BLCK Output Test Load 


1.8K 


1N914 





A121 
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Figure 66 Maximum Output Delay vs. Capacitive Loading 


16.5 
16.0 
15.5 
15.0 
Top (ns) 
14.5 
14.0 
13.5 


13.0 





NOTE: 
These values require termination of 330/220 ohm from Vcc to GND. 
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Figure 67 CLIPPER C300 Module C300C3MLX with Standard Connector 
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Figure 68 Pinout of CLIPPER C300 Module C300C3MLX Connector 











END VIEW OF MODULE 
cB A i 
Vee 1 Sl SATE GND 
CBSYi 2 = ane osc 
URDIAG 3 = Bai GND 
IVEC2 4 Ss Bay 2BCLK 
icelelS GND 
NEC? = ivect ulead 
IVEC7 6 | WEG NMi 
7 = bears BRI 
Pescanies = TvEce 
RESET 8 + Te RDYoi 
ADIs 9 | BIG 
-— NMIACK 
AD20 10 _ aa ECO 
AD22 11 = ror] Wees” 
1] 
AD24 12 = Aor ADI9 
GND 13 = GND 
+ AD23 
AD28 14 = ae AD25 
AD29 15 = AD27 
= Vee 
AD31 16 ss aos AD16 
MSBE/RETRY 17 i— ari GND 
AD13 18 = aie AD14 
AD10 19 = ap AD11 
= 
ADOS 20 = A ADO7 
Vee 21 = Vee 
J ADOs 
ADOS 22 = ADO3 
i— ADO4 
ADO2 23 LH 7 ADO1 
ADOo 24 = aT MMbE 
GND 25 = GND 
= TG2 
TG1 26 | BERR 
| TGO 
CT1 27 + BGd 
= GND 
CTo 28 | RDYi 
= CTs 
cT4 29 4 Vee 
= CT3 eae 
cT2 30 = 7 BS 
RDYo 31 +—| GND 
Maatidiess = SYNC 
LOCK 32 = aaa BLCK 


NOTE: Numbering on the CLIPPER Module female connector may not correspond 
to numbering on user-supplied male connectors. 


A140 
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Figure 69 CLIPPER C300 Module C300C3BLX with Box Connector 
SIDE VIEW OF MODULE 200 MAX 


BOX CONNECTOR =] 


_ 
McKENZIE PPC3/34- a 


S$H17-32/3H27-64- | | 
STGT 
(300 280 
TOP VIEW OF MODULE 
4.500 
3.100 
4.350 
-150 TYP 





D = .125 DIAMETER 


NOTE: 
Package dimensions are given in inches. A125 
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Figure 70 Pinout of CLIPPER C300 Module C300C3BLX Box Connector 





c B A 
Veo 1 GND 
BSY! 2 coll osc 
cl 
ROA 3 bata GND 
u Ga 
4 se 2BCLK 
iVEC2 
ivece 8 ney GND 
wae? . IVECI ror 
C7 gar 
lA 7 _ BRI 
CK VEC 
RESET 8 nce RDYol 
koi. 28 tat ae 
8 —=— 
AD20 10 oan IVECO 
apa 11 sil WECS 
AD17 
aD2 12 AD19 
AD21 
GND 13 QND 
AD23 
AD@ 14 AD25 
apd26 
AD29 415 AD27 
Veo 
AD31 16 AD16 
<a enmeeeiie AD30 
MSBE/RETRY 17 GND 
AD18s 
AD1I3 18 ADi4 
AD12 
AD1O0 «(19 ADI1 
GND 
ADoS 20 A007 
ADos 
Vee 21 Vee 
ADOSG 
ADOS 22 ADO3 
ADO4 
ADO2 23 ADO1 
Veo ‘ince 
ADOO 24 MMBE 
DIR 
GND 25 GND 
TG2 — 
Ta1 26 BERR 
TGO 
CT1 27 BGd 
GND 
CTo 28 RDYI 
CTs 
CT4 20 Vee 
cT3 
CT2 30 7h CBSYd 
RDYo 31 GND 
SYNC 
LOCK 32 BLCK 
BRd 


NOTE: Numbering on the CLIPPER Module female connector may not correspond 
. to numbering on user-supplied male connectors. 
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Figure 71 CLIPPER C300 Module C300C3DLX with Dual Connector 








F 
3 
220 88 
3 


NOTE: 
Package dimensions are given in inches. 
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Figure 72 Pinout of CLIPPER C300 Module C300C3DLX Dual Connector 


TOP VIEW OF MODULE 
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